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Field of the Invention 

This invention relates to the fields of genomics and pharmacogenetics. More specifically, 
the present invention relates to polymorphisms and haplotypes of the ^-adrenergic receptor gene 
and their use as predictors of disease susceptibility and response to p-agonists. 

Background of the Invention 

The p 2 -adrenergic receptor (P 2 AR) is a G protein coupled receptor that mediates the 
actions of catecholamines in multiple tissues and thus plays important roles in regulating cardiac, 
vascular, pulmonary, and metabolic functions. An abnormal level of expression of p 2 AR is 
believed to be a risk factor for or to modify the severity of a number of diseases and conditions, 
including congestive heart failure, arrhythmia, ischemic heart disease, hypertension, migraine, 
asthma, chronic obstructive pulmonary disease (COPD), anaphylaxis, obesity, diabetes, 
myasthenia gravis, and premature labor. 

The p 2 AR is encoded by an intronless gene on chromosome 5q3 1-32 (Kobilka, B.K. et 
nUProc.Natl.Acad.Sci., USA 84:46-50, 1987). Several single nucleotide polymorphisms (SNPs) 
in the coding block of the p 2 AR gene that lead to significant genetic variability in the structure of 
the p 2 AR protein in the human population have been reported (Reihsaus, E. et al., Am JResp Cell 
Mol Biol 8:334-339, 1993; Liggett, S.B., News in Physiologic Sciences 10:265-273, 1995; and 
GenBank accession numbers AF022953.1 GI:2570526; AF022954.1 GL2570528; and 
AF022956.1 GL2570532). These SNPs are located at nucleotides 46 (A or G), 79 (C or G), and 
491 (C or T) of the p 2 AR coding sequence, and result in variation that occurs in the amino- 
terminus of the receptor at amino acids 16 (Arg or Gly) and 27 (Gin or Glu) and in the fourth 
transmembrane spanning domain at amino acid 164 (Thr or lie), respectively. These amino acid 
variants have clear phenotypic differences as demonstrated by recombinant cell studies (Green, 
S.A. et al., Biochem 33:9414-9419, 1994; Green, S.A., et al., J Biol Chem 268:23116-23121, 
1993), primary cultures of cells endogenously expressing these variants (Green, S.A., et al., Am J 
Resp Cell Mol Biol 13 :25-33, 1995), and transgenic mice overexpressing the Thrl64 or Ilel64 
receptors in the heart (Turki et al., Proc. Natl. Acad. Sci.. USA 93:10483-10488, 1996). In 
addition, a synonymous polymorphism of C or A at nucleotide 523 in the coding sequence has 
been reported to be associated with altered responsiveness to salbutamol in Japanese families 
(Ohe, M. et al., Thorax 50:353-359, 1995). 



In addition to the above polymorphisms in the coding block, several SNPs in the 5' 
promoter region have recently been identified and are located at nucleotides -1023 (A or G), -654 
(G or A), -468 (C or G), -367 (T or C), -47 (C or T) and -20 (T or C) (Scott, M.G.H. et al., Br J 
Pharmacol 126:841-844, 1999). Thus, eleven polymorphic sites have previously been identified 
in the region of the p 2 AR gene located between nucleotides 565 and 21 10 of GenBank Accession 
No. M15169.1 (see Fig. 1 (SEQ ID NO:l)). 

Messenger RNA transcripts of the p 2 AR gene have a 5 ' leader region harboring a short 
open reading frame (ORF), termed the p 2 AR 5'-leader cistron (5'LC), that encodes a 19 amino 
acid peptide (Kobilka, B.K. et A., J.Biol.Chem.: 262:7321-7327, 1996). This p 2 AR upstream 
peptide (BUP) modulates translation of p 2 AR mRNA, and thereby regulates cellular expression 
of the receptor (Parola, A.L. et al., J Biol Chem 269:4497-4505, 1994). The polymorphic site 
located at -47, described above, is in this 5'LC and results in either Arg or Cys being encoded at 
me terminal ammo acid (position 19) of the BUP. It was recently reported that the Cysl9 variant 
of the BUP is associated with greater p 2 AR expression than the Argl9 BUP variant (McGraw et 
si., J. Clin. Invest. 102:1927-1932, 1998). 

Several groups have suggested associations between some of the above p 2 AR amino acid 
variants and increased susceptibility to various conditions, including: high blood pressure (Gly 1 6 
variant, Hoit et al., Am Heart ./ 139:537-542, 2000; Gratze et al., Hypertension 33:1425-1430, 
1999 and Kotanko, P. et al., Hypertension 30:773-776, 1997; cf. Argl6 variant, Busjahn et al., 
Hypertension 35:555-560, 2000); atopy (Glyl6 variant, Dewar et al., Clin. Exp. Allergy 28:442- 
448, 1998); nocturnal asthma (Glyl6 variant, Turki et al., J. Clin. Invest. 95:1635-1641, 1995); 
response to treatment for obesity (Glyl6 variant, Sakane et al., Lancet 353:1916, 1999); 
myasthenia gravis (Argl6 variant, Xu, B.-Y. et al., C//«. & Exp. Immunol. 119:156-160,2000); 
childhood asthma (Gln27 variant, Dewar et al., J. Allergy Clin. Immun. 100:261-265, 1997); 
obesity (Glu27 variant, Large et al., J. Clin. Invest. 100:3005-3013, 1997); and mortality from 
congestive heart failure (Ilel64 variant, Liggett et al., J. Clin. Invest. 102:1534-1539, 1998). 

. Several of the polymorphic sites (PS) in the p 2 AR gene have been reported to be in 
linkage disequilibrium with each other, including between the +46 and +79 PS (Martinez et al., J. 
Clin. Invest. 100:261-265, 1997; Dewar et al., supra), between the -47, +46 and +79 PS 
) (McGraw et al., supra), between the -47, -20, +46, and +79 PS (Yamada et al., J. Clin. 

Endocrinol. Metab. 84:1754-1757, 1999), and between the +79 and +523 PS (Dewar et al., Clin, 
and Exp. Allergy 28:442-448, 1998). In addition, associations between various in vivo 
phenotypes and haplotypes for various combinations of these polymorphic sites have been 
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suggested: obesity and a -47 C/-20 C haplotype (Y amada et al., supra), asthma severity and a 
+46G/+79G haplotype (which encodes theGlyl6/Gln27 variant) (Weir et al., Am J. Resp. Crit 
Care Med. 158:787-791, 1998); bronchial hyperresponsiveness (BHR) and a +46 G/+79 G 
haplotype (D'amato et al.,^m. J. Resp. Crit Care Med 158:1968-1973, 1998); hypertension and 
a -47 T/+46 A/+79 G haplotype, and the following expanded version thereof: 
-1023 G/-654 A/-47 T/-20 C/+46 A/+79 C (Timmeiman et al. Kidney Int. 53:1455-1460, 1998; 
WO 99/37761). An association between reduced P2AR promoter activity in vitro and a 
haplotype of -468 G/-367 C/-47 C/-20 C has also been reported (Scott et al., Br. J. Pharmacol. 
126:841-844, 1999). However, no haplotypes covering more than six of the above 11 sites have 
been reported. 

It has also been suggested that some of the P2AR gene polymorphisms discussed above 
may act as disease modifiers in asthma or may be the basis for the known interindividual 
variation in the bronchodilating response to p-agonists (Liggett, S.B. 'The genetics of P2- 
adrenergic receptor polymorphisms: relevance to receptor function and asthmatic phenotypes." 
in: Liggett, S.B. & Meyers, D.A., The Genetics of Asthma (1996) pp. 455-478). Indeed, two 
groups have reported that individuals homozygous or heterozygous for the Argl6 variant are 
more likely to respond to albuterol than individuals homozygous for the Glyl6 variant. 
(Martinez, F.D. et al., J Clin Invest 100:3184-3188, 1997 and Lima, J.J., et al., Clin Pharmacol 
Ther 65:5 19-525, 1999). It has also been reported that asthmatic individuals who are 
homozygous for the Argl6 variant are more likely to exhibit decreased response to repeated use 
of albuterol (Drazen et al., WO 98/39477). Interestingly, another group reported bronchodilator 
desensitization in asthmatics homozygous for the Glyl6 variant following continuous therapy 
with the P-agonist formoterol (Tan et al., Lancet 350:995-999, 1997). Other studies failed to 
demonstrate any correlations between adverse drug response and regular treatment with p- 
agonists (Hancox, RJ. et al., Eur Respir J 11 :589-593, 1998; Lipworth, B.J. et al., Clinical 
Science 96:253-259, 1999). Moreover, none of the human physiologic studies assessed the 
relevance of haplotypes of multiple P2AR polymorphisms in both the promoter and coding 
regions for predicting the bronchodilator response to p-agonists. Also, the phylogeny of these 
haplotypes and their distribution amongst different ethnic groups, which has particular relevance 
to pharmacogenetics, has not been explored. 

Because of the potential for individual polymorphisms and haplotypes in the P2AR gene 
to affect susceptibility to a number of diseases, as well as affect response to P-agonist therapy, it 
would be useful to determine whether additional polymorphisms exist in the P2AR gene, how 
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such polymorphisms are combined in different copies of the gene to (haplotypes), and whether 
the frequencies of such polymorphisms and haplotypes vary among different ethnic groups. Such 
information would be useful for studying the biological function of P2AR as well as in 
identifying drugs targeting P2AR for the treatment of disorders related to its abnormal expression 
or function. 

Summary of the Invention 

Accordingly, the inventors herein have discovered two novel polymorphic sites in the 
P2AR gene. These polymorphic sites (PS) correspond to the nucleotide positions 879 and 1182 
in the promoter region of the P2AR gene (see Fig. 1) and are designated PS2 and PS5, 
respectively, to reflect their order in the P2AR gene relative to the other 1 1 polymorphic sites 
(Table 3). The polymorphisms at these sites are cytosine or adenine at PS2 and cytosine or 
thymine at PS5. It is believed that p2AR-encoding polynucleotides containing one or more of 
these novel polymorphic sites will be useful in studying the expression and biological function of 
P2AR, as well as in developing drugs targeting this receptor. 

In addition, the inventors have determined the identity of the alternative nucleotides 
present at these sites, and at the previously identified 1 1 polymorphic sites described above (see 
Table 3), in a human reference population of apparently normal unrelated individuals 
representing four major population groups and in a cohort of asthma patients: The inventors 
herein have also identified how the polymorphisms at these 1 3 polymorphic sites in the P2AR 
gene (see Figure 1) are combined into haplotypes in the reference and patient populations (see 
Tables 4 and 5) and discovered that certain pairs of these haplotypes, designated 2/2, 2/4, 2/6, 4/4 
and 4/6 are predictive of bronchodilator response to albuterol. Asthmatic patients having P2AR. 
haplotype pairs 2/2 and 4/6 respond well to albuterol while patients with P2AR haplotype pairs 
21 A and 4/4 exhibit little to no response. P2AR haplotype pair 2/6 is associated with a moderate 
bronchodilator response. The inventors herein have also discovered that the presence of one of 
these medically significant haplotype pairs in an asthma patient may be predicted with high 
confidence by genotyping only three sites: PS 3, PS9 and PS1 1 . 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 
nucleotide sequence which is a polymorphic variant of a reference sequence for the P2AR gene or 
a fragment thereof. The reference sequence comprises SEQ ID NO:l and the polymorphic 
variant comprises at least one polymorphism selected from the group consisting of adenine at 
PS2 and thymine at PS5. A particularly preferred polymorphic variant is a naturally-occurring 



isoform (also referred to herein as an "isogene") of the P2AR gene. A P2AR isogene of the 
invention comprises guanine or adenine at PS1, cytosine or adenine at PS2, guanine or adenine at 
PS3, guanine or cytosine at PS4, cytosine or thymine at PS5, cytosine or thymine at PS6, 
cytosine or thymine at PS7, thymine or cytosine at PS 8, adenine or guanine at PS9, cytosine or 
guanine at PS10, guanine or adenine at PSl 1, cytosine or thymine at PS12, and cytosine or 
adenine at PSl 3. The invention also provides a collection of P2AR isogenes, referred to herein as 
a P2AR genome anthology. > 

A P2AR isogene may be defined by the combination and order of these polymorphisms in 
the isogene, which is referred to herein as a P2AR haplotype.. Thus, the invention also provides 
data on the number of different P2AR haplotypes found in the above four population groups. 
This haplotype data is useful in methods for deriving a P2AR haplotype from an individual's 
genotype for the P2AR gene and for determining an association between a P2AR haplotype and a 
particular trait. 

In another embodiment, the invention provides a recombinant expression vector 
comprising one of the polymorphic P2AR genomic variants operably linked to expression 
regulatory elements as well as a recombinant host cell transformed or transfected with the 
expression vector. The recombinant vector and host cell may be used to express P2AR for 
protein structure analysis and drug binding studies. 

The invention also provides methods, compositions, and kits for haplotyping and/or. 
geno typing the P2AR gene in an individual. In one embodiment, the geno typing method 
comprises isolating from the individual a nucleic acid mixture comprising the two copies of the 
P2AR gene present in the individual and determining the identity of the nucleotide pair at one or 
both of PS2 and PS5 in the two copies to assign a P2AR genotype to the individual. In another 
embodiment, a method for predicting an individual's haplotype pair comprises determining the 
individual's genotype at PS3, PS9 and PSl 1 . The compositions contain oligonucleotide probes or 
primers designed to specifically hybridize to one or more target regions containing, or that are 
adjacent to, one or both of PS2 and PS5. Kits of the invention comprise a set of oligonucleotides 
for genotyping at least PS3, PS9 and PSl 1 and may also comprise additional oligonucleotides for 
genotyping one or more additional polymorphic sites selected from the group consisting of PSl, 
PS2, PS4, PS5, PS6, PS7, PS8, PS10, PS12 and PS13. The methods and compositions for 
genotyping or haplotyping PS2 and PS5 in the P2AR gene are useful for studying the effect of the 
alternative nucleotides at PS2 and PS5 in the etiology of various diseases and efficacy of drugs 



targeting the p 2 AR. Methods and kits for genotyping and haplotyping PS3, PS9 and PS 1 1 are 
useful for predicting an asthmatic patient's bronchodilating response to p-agonists. 

In yet another embodiment, the invention provides a method for identifying an 
association between a P2AR haplotype and a trait. In preferred embodiments, the trait is 
susceptibility to a disease, disease severity, the staging of a disease or response to a drug. Such 
methods have applicability in developing diagnostic tests and therapeutic treatments for one or 
more conditions selected from the group consisting of congestive heart failure, arrhythmia, 
ischemic heart disease, hypertension, migraine, asthma, chronic obstructive pulmonary disease 
(COPD), anaphylaxis, obesity, diabetes, myasthenia gravis (MG) and premature labor. In other 
preferred embodiments, the drug is an agonist or antagonist of p 2 AR. 

The present invention also provides genetically modified animals comprising one or more 
of the novel P2AR genomic polymorphic variants described herein and methods for producing 
such animals. Such animals are useful for studying expression of the P2AR isogenes in vivo, for 
in vivo screening and testing of drugs targeted against P2AR protein, and for testing the efficacy 
of therapeutic agents and compounds targeting the p 2 AR in a biological system. 

The present invention also provides a computer system for storing and displaying 
polymorphism data determined for the P2AR gene. The computer system comprises a computer 
processing unit; a display; and a database containing the polymorphism data. The polymorphism 
data includes the polymorphisms, the genotypes and the haplotypes identified for the p 2 AR gene 
in one or both of the reference population and the patient population. In a preferred embodiment, 
the computer system is capable of producing a display showing P2AR haplotypes organized 
according to their evolutionary relationships. 

Another aspect of the invention is based on the discovery of novel information relating to 
linkage disequilibrium in Caucasians between PS1, PS3, PS4 and PS6 and the p 2 AR polymorphic 
sites previously reported to be associated with various medical conditions, i.e., PS9 and PS 10. 
Thus, the present invention also provides a method for predicting a Caucasian individual's 
genetic predisposition to any disease or condition known to be associated with one of the 
alternative alleles at PS9 or PS 10 in the P2AR gene. The method comprises determining a first 
genotype for a first polymorphic site in the individual's P2AR gene, wherein the first 
polymorphic site is selected from the group consisting of PS1, PS3, PS4 and PS6, and using the 
first genotype to predict a second genotype for a second polymorphic site in the individual's 
p 2 AR gene, wherein if the first polymorphic site is PS1 , PS4 or PS6, then the second 
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polymorphic site is PS 10, and if the first polymorphic site is PS3, then the second polymorphic 
site is PS 9. In a preferred embodiment, the disease or condition is selected from the group 
consisting of atopy, nocturnal asthma, childhood asthma, hypertension, obesity, response to 
treatment for obesity and MG. 

5 The present invention further provides methods for predicting an asthma patient's 

response to p -agonist therapy. In one embodiment, the method comprises determining the 
genotype for the patient at PS3, PS9 and PS1 1, wherein the patient is likely to exhibit a good 
response to a standard dose of the P-agonist if the patient is homozygous for guanine at each of 
PS3, PS9 and PS1 1 or. if the patient is heterozygous A/G at each of PS3, PS9 and PS1 1. If the 

10 patient is homozygous A/A/G at PS3, PS9 and PS1 1, respectively, then the patient is likely to not 
respond to standard dosages of the P-agonist. In a preferred embodiment, the p-agonist is 
albuterol. Thus, knowledge of a patient's (3 2 AR genotype for PS3, PS9 and PS1 1 provides a 
physician with information useful for making determinations as to which drug to administer and 
dosages of the drug. 



•g , 15 Brief Description of the Figures 

Figure 1 illustrates the DNA sequence for a reference sequence of the human B2AR gene 
(GenBank Accession No. M15169.1; SEQ ID NO:l), with the underlines indicating the start and 
stop codons, shading indicating the reference coding sequence, and bold nucleotides indicating 
the polymorphic sites and polymorphisms identified by Applicants in the reference and asthma. 
20 patient populations. 

Figure 2 illustrates a reference amino acid sequence for B2AR (contiguous lines; SEQ ED 
NO:2), with the bold amino acids indicating the amino acid variations caused by the 
polymorphisms at nucleotides 1633, 1666 and 2078 in the B2AR gene. 

Figure 3 illustrates the phylogeny of haplotypes of the B2AR gene, with each haplotype 
25 represented by a circle, the area of which representing the overall frequency of that haplotype in 
the reference population, and subdivisions representing the frequency of the haplotype in each of 
the four population groups. Differences between the haplotypes are indicated by the lines 
connecting the circles, with solid black lines for single-site differences, gray lines for two-site 
differences and dashed lines for more than two differences. 
30 Figure 4 illustrates the linkage disequihbrium between thirteen polymorphic sites in the 

6 2 AR gene with the degree of linkage indicated by shading. 
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Figure 5 shows a bar graph of the in vivo FEVi responses to albuterol exhibited by asthma 
patients having the indicated haplotype pairs. 

Figure 6 A shows a bar graph illustrating the amount of B 2 AR protein expression in 
HEK293 cells co-transfected with a vector containing the indicated B 2 AR isogene and a 
luciferase construct. 

Figure 6B shows a bar graph illustrating the amount of B 2 AR protein expression from Fig. 
6A after correction for transfection efficiency as quantitated by luciferase activity. 

Figure 6C shows a bar graph of p 2 AR mRNA expression in the transfected HEK293 cells 
of Fig. 6 A. 

Description of the Preferred Embodiments 

In accordance with the present invention, the inventors herein have discovered novel 
variants of the p 2 AR gene. As described in more detail below, the inventors herein discovered 
two novel polymorphic sites by characterizing an approximate 1.8 kb region of the p 2 AR gene 
found in genomic DNAs isolated from an asthmatic cohort (121 Caucasians and 13 African 
Americans) and an Index Repository that contains immortalized cell lines from 93 human 
individuals, 76 of which comprised a reference population of unrelated individuals self-identified 
as belonging to one of four major population groups: Caucasian (23 individuals), African descent 
(19 individuals), Asian (20 individuals) and Hispanic-Latino (15 individuals). In addition, the 
Index Repository contains three families: two three- generation Caucasian families from the 
CEPH-Utah cohort and one two-generation African- American family. 

Using the p 2 AR genotypes identified in the Index Repository and patient cohort, in 
conjunction with the methodology described in the Examples below, the inventors herein also 
determined the haplotypes found on each chromosome for most individuals in these populations, 
and determined the frequencies of these haplotypes in the four major population groups. The " 
P2AR genotypes and haplotypes found in the Index Repository and patient cohort include those 
shown in Table 4 and 5, respectively. It is believed the p 2 AR polymorphism and haplotype data 
disclosed herein are useful for studying population diversity, anthropological lineage, the 
significance of diversity and lineage at the phenotypic level, paternity testing, forensic 
applications, and for identifying associations between p 2 AR genetic variation and a trait such as 
level of drug response or susceptibility to disease. 
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As disclosed in more detail below, certain pairs of these P2AR haplotypes, designated 2/2, 
2/4, 2/6, 4/4 and 4/6 are predictive of bronchodilator response to albuterol. Thus, the present 
invention is useful in prescribing P-agonists for treating bronchospasm. 

In the context of this disclosure, the following terms shall be defined as follows unless 
otherwise indicated: 

Allele - A particular form of a genetic locus, distinguished from other forms by its 
particular nucleotide or amino acid sequence. x 

Gene - A segment of DNA that contains all the information for the regulated biosynthesis 
of an RNA product, including promoters, exons, introns, and other untranslated regions that 
control expression. 

Genotype - An unphased 5 ' to 3 ' sequence of nucleotide pair(s) found at one or more 
polymorphic sites in a locus on a pair of homologous chromosomes in an individual. 

Full-genotype - The unphased 5 ' to 3 ' sequence of nucleotide pairs found at all known 
polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 

Sub-genotype - The unphased 5' to 3' sequence of nucleotides seen at a subset of the 
known polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 

Genotyping - A process for determining a genotype of an individual. 

Haplotype - A phased 5' to 3 ' sequence of nucleotides found at two or more 
polymorphic sites in a locus on a single chromosome from a single individual. 

Haplo typing - A process for determining a haplotype of an individual. 

Haplotype pair - The two haplotypes found for a locus in a single individual. 

Full-haplotypei — The 5' to 3' sequence of nucleotides found at all known polymorphic 
sites in a locus on a single chromosome from a single individual. 

Sub-haplotype - The 5' to 3' sequence of nucleotides seen at a subset of the known 
polymorphic sites in a locus on a single chromosome from a single individual. 

Haplotype data - Information concerning one or more of the following for a specific 
gene: a listing of the haplotype pairs in each individual in a population; a listing of the different 
haplotypes in a population; frequency of each haplotype in that or other populations, and any 
known associations between one or more haplotypes and a trait. 

Isoform - A particular form of a gene, mRNA, cDNA or the protein encoded thereby, 
distinguished from other forms by its particular sequence and/or structure. 

Isogene - One of the isoforms of a gene found in a population. An isogene contains all of 
the polymorphisms present in the particular isoform of the gene. 
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Isolated - As applied to a biological molecule such as RNA, DNA, oligonucleotide, or 
protein, isolated means the molecule is for practical purposes free of other biological molecules 
such as non-desired nucleic acids, proteins, lipids, carbohydrates, or other material such as 
cellular debris and growth media. Generally, the term "isolated" is not intended to refer to a 
5 complete absence of such material or to absence of water, buffers, or salts, unless they are 
present in amounts that substantially interfere with the methods of the present invention. 

Locus - A location on a chromosome or DNA molecule corresponding to a gene or a 
physical or phenotypic feature. 

Naturally-occurring - A term used to designate that the object it is applied to, e.g., 
10 naturally-occurring polynucleotide or polypeptide, can be isolated from a source in nature and 
which has not been intentionally modified by man. 

Nucleotide pair - The nucleotides found at a polymorphic site on corresponding strands 
of the two copies of a chromosome in an individual. 
; ;t 7 Phased - As applied to a sequence of nucleotide pairs for two or more polymorphic sites 

ry 15 in a locus, phased means the combination of nucleotides present at those polymorphic sites on a 

single copy of the locus is known. . 
^ Polymorphic site (PS) - A position within a locus at which at least two alternative 

y sequences are found in a population. 

?p Polymorphic variant - A gene, mRNA, cDNA, polypeptide or peptide whose nucleotide 

1 1 20 or amino acid sequence varies from a reference sequence due to the presence of a polymorphism 
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in the gene. 

Polymorphism - The sequence variation observed in an individual at a polymorphic site. 
Polymorphisms include nucleotide substitutions, insertions, deletions and microsatellites and 
may, but need not, result in detectable differences in gene expression or protein function. 
25 Polymorphism Database - A collection of polymorphism data arranged in a systematic 

or methodical way and capable of being individually accessed by electronic or other means. 

Polynucleotide - A nucleic acid molecule comprised of single-stranded RNA or DNA or 
comprised of complementary, double-stranded DNA. 

Reference Population - A group of subjects or individuals who are predicted to be 
30 representative of the genetic variation found in the general population. In preferred 

embodiments, the reference population represents the genetic variation in the population at a 
certainty level of at least 85%, preferably at least 90%, more preferably at least 95% and even 
more preferably at least 99%. 
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Single Nucleotide Polymorphism (SNP) - Typically, the specific pair of nucleotides 
observed at a single polymorphic site. In rare cases, three or four nucleotides maybe found. 

Subject - A human individual whose genotypes or haplotypes or response to treatment or 
disease state are to be determined. 

Treatment - A stimulus administered internally or externally to an individual. 

Population Group - A group of individuals sharing a common ethnogeographic origin. 

Unphased - As applied to a sequence of nucleotide pairs for two or more polymorphic 
sites in a locus, unphased means the combination of nucleotides present at those polymorphic 
sites on a single copy of the locus is not known. 

The inventors herein have discovered two novel polymorphic sites in the P2AR gene, 

r 

which are referred to as of PS2 and.PSS to designate the order in which they are located in the 
gene (see Figure 1 and Table 3 below). PS1, PS3, PS4, PS5, PS6, PS7, PS8, PS9, PS10, PS11, 
PS12, and PS13 designate the previously identified polymorphic sites located at nucleotides 565, 
934, 1120, 1221, 1541, 1568, 1633, 1666, 1839, 2078 and 21 10 in Figure 1. 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 
polymorphic variant of the P2AR gene or. a fragment of the gene which contains at least one of 
the novel polymorphic sites described herein. The nucleotide sequence of a variant P2AR gene is 
identical to the reference genomic sequence for the region of the gene examined, as described in 
the Examples below, except that it comprises a different nucleotide at one or both of polymorphic 
sites PS2 and PS5. Similarly, the nucleotide sequence of a variant fragment of the P2AR gene is 
identical to the corresponding portion of the reference sequence except for having a different 
nucleotide at one or more of these polymorphic sites. Thus, the invention specifically does not 
include polynucleotides comprising a nucleotide sequence identical to the reference sequence (or 
other reported P2AR sequences) or to portions of the reference sequence (or other reported P2AR 
sequences), except for genotyping oligonucleotides as described below. The location of a 
polymorphism in a variant gene or fragment is identified by aligning its sequence with SEQ ID 
NO:l. The polymorphism is selected from the group consisting of cytosine at PS2 and cytosine 
at PS5. In a preferred embodiment, the polymorphic variant comprises a naturally-occurring 
isogene of the P2AR gene which is defined by any one of haplotypes 1-12 shown in Table 5 
below. 

Polymorphic variants of the invention may be prepared by isolating a clone containing the 
P2AR gene from a human genomic library. The clone may be sequenced to determine the 
identity of the nucleotides at the polymorphic sites described herein. Any particular variant 
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claimed herein could be prepared from this clone by performing in vitro mutagenesis using - 
procedures well-known in the art. Alternatively, a polymorphic variant of the p 2 AR gene may be 
chemically synthesized. 

(3 2 AR isogenes may be isolated using any method that allows separation of the two 
"copies" of the p?AR gene present in an individual, which, as readily understood by the skilled 
artisan, may be the same allele or different alleles. Separation methods include targeted in vivo 
cloning (TIVC) in yeast as described in WO 98/01573, U.S. Patent No. 5,866,404, and copending 
U.S. application Serial No. 08/987,966. Another method, which is described in copending U.S. 
Application Serial No. 08/987,966, uses an allele specific oligonucleotide in combination with 
primer extension and exonuclease degradation to generate, hemizygous DNA targets. Yet other 
methods are single molecule dilution (SMD) as described in Ruaiio et al., Proc. Natl. Acad. Sci. 
87:6296-6300, 1990; and allele specific PCR (Ruaiio et al., 17 Nucleic Acids. Res. 8392, 1989; 
Ruaiio et al., 19 Nucleic Acids Res. 6877-6882, 1991; Michalatos-Beloin et al., 24 Nucleic Acids 
Res. 4841-4843, 1996). 

The invention also provides P2AR genome anthologies, which are collections of P2AR 
isogenes found in a given population. The population may be any group of at least two 
individuals, including but not limited to a reference population, a population group, a family 
population, a clinical population, and a same sex population. A P2AR genome anthology may 
comprise individual P2AR isogenes stored in separate containers such as microtest tubes, separate 
wells of a micro titre plate and the like. Alternatively, two or more groups of the P2AR isogenes 
in the anthology may be stored in separate containers. Individual isogenes or groups of isogenes 
in a genome anthology may be stored in any convenient and stable form, including but not. . 
limited to in buffered solutions, as DNA precipitates, freeze-dried preparations and the like. A 
preferred P2AR genome anthology of the invention comprises a set of isogenes defined by the 
haplotypes shown in Table 5 below. 

An isolated polynucleotide containing a polymorphic variant nucleotide sequence of the 
invention may be operably linked to one or more expression regulatory elements in a 
recombinant expression vector capable of being propagated and expressing the encoded P2AR 
protein in a prokaryotic or a eukaryotic host cell. Examples of expression regulatory elements 
which may be used include, but are not limited to, the lac system, operator and promoter regions 
of phage lambda, yeast promoters, and promoters derived from vaccinia virus, adenovirus, 
retroviruses, or SV40. Other regulatory elements include, but are not limited to, appropriate 
leader sequences, termination codons, polyadenylation signals, and other sequences required for 
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the appropriate transcription and subsequent translation of the nucleic acid sequence in a given 
host cell. Of course, the correct combinations of expression regulatory elements will depend on 
the host system used. In addition, it is understood that the expression vector contains any 
additional elements necessary for its transfer to and subsequent replication in the host cell. 
Examples of such elements include, but are not limited to, origins of replication and selectable 
markers. Such expression vectors are commercially available or are readily constructed using 
methods known to those in the art (e.g., F. Ausubel et al., 1987, in "Current Protocols in 
Molecular Biology", John Wiley and Sons, New York, New York). Host cells which may be 
used to express the variant P2AR sequences of the invention include, but are not limited to, 
eukaryotic and mammalian cells, such as animal, plant, insect and yeast cells, and prokaryotic 
cells, such as E. coli, or algal cells as known in the art. The recombinant expression vector may 
be introduced into the host cell using any method known to those in the art including, but not 
limited to, microinjection, electroporation, particle bombardment, transduction, and transfection 
using DEAE-dextran, lipofection, or calcium phosphate (see e.g., Sambrook et al. (1989) in 
"Molecular Cloning. A Laboratory Manual", Cold Spring Harbor Press, Plainview, New York). 
In a preferred aspect, eukaryotic expression vectors that function in eukaryotic cells, and 
preferably mammalian cells, are used. Non-limiting examples of such vectors include vaccinia 
virus vectors, adenovirus vectors, herpes virus vectors, and baculo virus transfer vectors. 
Preferred eukaryotic cell lines include COS cells, CHO cells, HeLa cells, NIH/3T3 cells, and 
embryonic stem cells (Thomson, J. A. et al., 1998 Science 282:1 145-1 147). Particularly 
preferred host cells are mammalian cells. 

In describing the polymorphic sites identified herein, reference is made to the sense strand 
of the gene for convenience. However, as recognized by the skilled artisan, nucleic acid 
molecules containing the P2AR gene may be complementary double stranded molecules and thus 
reference to a particular site on the sense strand refers as well to the corresponding site on the 
complementary antisense strand. Thus, reference may be.made to the same polymorphic site on 
either strand and an oligonucleotide may be designed to hybridize specifically to either strand at 
a target region containing the polymorphic site. Thus, the invention also includes single-stranded 
polynucleotides which are complementary to the sense strand of the P2AR genomic variants 
described herein. 

Polynucleotides comprising a polymorphic gene variant or fragment may be useful for 
therapeutic purposes. For example, where a patient could benefit from expression, or increased 
expression, of a particular P2AR protein isoform, an expression vector encoding the isoform may 
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be administered to the patient. The patient may be one who lacks the P2AR isogene encoding 
that isoform or may already have at least one copy of that isogene. 

In other situations, it may be desirable to decrease or block expression of a particular 
p 2 AR isogene. Expression of a P2AR isogene may be turned off by transforming a targeted 
organ, tissue or cell population with an expression vector that expresses high levels of 
untranslatable mRNA for the isogene. Alternatively, oligonucleotides directed against the 
regulatory regions (e.g., promoter, introns, enhancers, 3 ' untranslated region) of the isogene may 
block transcription. Oligonucleotides targeting the transcription initiation site, e>g., between 
positions -10 and +10 from the start site are preferred. Similarly, inhibition of transcription can 
be achieved using oligonucleotides that base-pair with region(s) of the isogene DNA to form 
triplex DNA (see e.g., Gee et al. in Huber, B.E. and B.I. Carr, Molecular and Immunologic 
Approaches, Futura Publishing Co., Mt. Kisco, N.Y., 1994). Antisense oligonucleotides may 
also be designed to block translation of P2AR mRNA transcribed from a particular isogene. It is 
also contemplated that ribozymes may be designed that can catalyze the specific cleavage of 
P2AR mRNA transcribed from a particular isogene. 

The oligonucleotides may be delivered to a target cell or tissue by expression from a 
vector introduced into the cell or tissue in vivo or ex vivo. Alternatively, the oligonucleotides 
may be formulated as a pharmaceutical composition for administration to the patient. 
Oligoribonucleotides and/or oligodeoxynucleotides intended for use as antisense oligonucleotides 
may be modified to increase stability and half-life. Possible modifications include, but are not 
limited to phosphorothioate or 2' O-methyl linkages, and the inclusion of nontraditional bases 
such as inosine and queosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of 
adenine, cytosine, guanine, thymine, and uracil which are not as easily recognized by 
endogenous nucleases. " ■ _ - 

Effect(s) of the novel polymorphisms and haplotypes identified herein on expression of 
p 2 AR may be investigated by preparing recombinant cells and/or organisms, preferably 
recombinant animals, containing a polymorphic variant of the P2AR gene. As used herein, 
"expression" includes but is not limited to one or more of the following: transcription of the gene 
into precursor mRNA; splicing and other processing of the precursor mRNA to produce mature 
mRNA; mRNA stability; translation of the mature mRNA into P2AR protein (including codon 
usage and tRNA, availability); and glycosylation and/or other modifications of the translation 
product, if required for proper expression and function. 
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To prepare a recombinant cell of the invention, the desired P2AR isogene may be 
introduced into the cell in a vector such that the isogene remains extrachrornosomal. In such a 
situation, the gene will be expressed by the cell from the extrachrornosomal location. In a 
preferred embodiment, the P2AR isogene is introduced into a cell in such a way that it 
recombines with the endogenous P2AR gene present in the cell. Such recombination requires the 
occurrence of a double recombination event, thereby resulting in the desired P2AR gene 
polymorphism. Vectors for the introduction of genes both for recombination and for 
extrachrornosomal maintenance are known in the art, and any suitable vector or vector construct 
may be used in the invention. Methods such as electroporation, particle bombardment, calcium 
phosphate co-precipitation and viral transduction for introducing DNA into cells are known in 
the art; therefore, the choice of method may he with the competence and preference of the skilled 
practitioner.. Examples of cells into which the P2AR isogene may be introduced include, but are 
not limited to, continuous culture cells, such as COS, NIH/3T3, and primary or culture cells of 
the relevant tissue type, i.e., they are known to express P2AR, such as the human embryonic 
kidney cell line HEK293. Such recombinant cells can be used to compare the biological 
activities of the different protein variants. 

Recombinant organisms, i.e., genetically modified animals, expressing a variant P2AR 
gene are prepared using standard procedures known in the art. Preferably, a construct comprising 
the variant human gene is introduced into a nonhuman animal or an ancestor of the animal at an 
embryonic stage, i.e., the one-cell stage, or generally not later than about the eight-cell stage. 
Genetically-modified animals carrying the constructs of the invention can be made by several 
methods known to those having skill in the art. One method involves transfecting into the 
embryo a retrovirus constructed to contain one or more insulator elements, a gene or genes, of 
interest, and other components known to those skilled in the art to provide a complete shuttle 
vector; harboring the insulated gene(s) as a transgene, see e.g., U.S. Patent No. 5,610,053. 
Another method involves directly injecting a transgene into the embryo. A third method involves 
the use of embryonic stem cells. Preferably the genetic modification process results in 
replacement of the animal's P2AR gene with the human P2AR gene. Examples of animals into 
which the human P2AR isogenes may be introduced include, but are not limited to, mice, rats, 
other rodents, and nonhuman primates (see "The Introduction of Foreign Genes into Mice" and 
the cited references therein, In: Recombinant DNA, Eds. J.D. Watson, M. Gilman, J. Witkowski, 
and M. Zoller; W.H. Freeman and Company, New York, pages 254-272). Recombinant 
nonhuman animals stably expressing a human p 2 AR isogene and producing human P2AR protein 
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can be used as biological models for studying diseases related to abnormal P2AR expression 
and/or activity, and for screening and assaying various candidate drugs, compounds, and 
treatment regimens to reduce the symptoms or effects of these diseases. 

An additional embodiment of the invention relates to pharmaceutical compositions for 
treating disorders affected by expression or function of a novel P2AR isogene described herein. 
Such disorders include congestive heart failure, arrhythmia, ischemic heart disease, hypertension, 
migraine, asthma, chronic obstructive pulmonary disease (COPD), anaphylaxis, obesity, diabetes, 
myasthenia gravis, and premature labor. The pharmaceutical composition may comprise any of 
the following active ingredients: a polynucleotide comprising one of these novel P2AR isogenes; 
an antisense oligonucleotide directed against one of the novel P2AR isogenes, a polynucleotide 
encoding such an antisense oligonucleotide, or another compound which activates of inhibits 
expression of a novel P2AR isogene described herein. Preferably, the composition contains the 
active ingredient in a therapeutically effective amount. By therapeutically effective amount is 
meant that one or more of the symptoms relating to disorders related to the expression or function 
of a novel P2AR isogene is reduced and/or eliminated. The composition also comprises a 
pharmaceutically acceptable carrier, examples of which include, but are not limited to, saline, 
buffered saline, dextrose, and water. Those skilled in the art may employ a formulation most 
suitable for the active ingredient, whether it is a polynucleotide, oligonucleotide, protein, peptide 
or small molecule antagonist. The pharmaceutical composition may be administered alone or in 
combination with at least one other agent, such as a stabilizing compound. Administration of the 
pharmaceutical composition may be by any number of routes including, but not limited to oral, 
intravenous, intramuscular, infra-arterial, intramedullary, intrathecal, intraventricular, 
intradermal, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or 
rectal. . Further details on techniques for formulation and administration may be found in the . 
latest edition of Remington's Pharmaceutical Sciences (Maack Publishing Co., Easton, PA). 

For any composition, determination of the therapeutically effective dose of active 
ingredient and/or the appropriate route of administration is well within the capability of those 
skilled in the art. For example, the dose can be estimated initially either in cell culture assays or 
in animal models. The animal model may also be used to determine the appropriate 
concentration range and route of administration; Such information can then be used to determine 
useful doses and routes for administration in humans. The exact dosage will be determined by 
the practitioner, in light of factors relating to the patient requiring treatment, including but not 
limited to severity of the disease state, general health, age, weight and gender of the patient, diet, 
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time and frequency of administration, other drugs being taken by the patient, and 
tolerance/response to the treatment. 

Information on the identity of genotypes and haplotypes for the gene of any particular 
individual as well as the frequency of such genotypes and haplotypes in any particular population 
of individuals is expected to be useful for a variety of basic research and clinical applications. 
Thus, the invention also provides compositions and methods for detecting the novel P2AR 
polymorphisms and haplotypes identified herein. 

The compositions comprise at least one P2AR genotyping oligonucleotide. In one 
embodiment, a P2AR genotyping oligonucleotide is a probe or primer capable of hybridizing to a 
target region that is located close to, or that contains, one of the novel polymorphic sites 
described herein. As used herein, the term "oligonucleotide" refers to a polynucleotide molecule 
having less than about 100 nucleotides. A preferred oligonucleotide of the invention is 10 to 35 
nucleotides long. More preferably, the oligonucleotide is between 15 and 30, and most 
preferably, between 20 and 25 nucleotides in length. The oligonucleotide may be comprised of 
. any phosphorylation state of ribonucleotides, deoxyribonucleotides, and acyclic nucleotide 
derivatives, and other functionally equivalent derivatives. Alternatively, oligonucleotides may 
have a phosphate-free backbone, which may be comprised of linkages such as carboxymethyl, 
acetamidate, carbamate, polyamide (peptide nucleic acid (PNA)) and the like (Varma, R. in 
Molecular Biology and Biotechnology, A Comprehensive Desk Reference, Ed. R. Meyers, VCH 
Publishers, Inc. (1995), pages 617-620). Oligonucleotides of the invention may be prepared by 
chemical synthesis using any suitable methodology known in the art, or may be derived from a 
biological sample, for example, by restriction digestion.. The oligonucleotides may be labeled, 
according to any technique known in the art, including use of radiolabels, fluorescent labels, 
enzymatic labels, proteins, haptens, antibodies, sequence tags and the like. 

Genotyping oligonucleotides of the invention must be capable of specifically hybridizing 
to a target region of a P2AR polynucleotide, i.e., a P2AR isogene. As used herein, specific 
hybridization means the oligonucleotide forms an anti-parallel double-stranded structure with the 
target region under certain hybridizing conditions, while failing to form such a structure when 
incubated with a non-target region or a non~p 2 AR polynucleotide under the same hybridizing 
conditions. Preferably, the oligonucleotide specifically hybridizes to the target region under 
conventional high stringency conditions. The skilled artisan can readily design and test 
oligonucleotide probes and primers suitable for detecting polymorphisms in the P2AR gene using 
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the polymorphism information provided herein in conjunction with the known sequence 
information for the P2AR gene and routine techniques. 

A nucleic acid molecule such as an oligonucleotide or polynucleotide is said to be a 
"perfect" or "complete" complement of another nucleic acid molecule if every nucleotide of one 
of the molecules is complementary to the nucleotide at the corresponding position of the other 
molecule. A nucleic acid molecule is "substantially complementary" to another molecule if it 
hybridizes to that molecule with sufficient stability to remain in a duplex form under 
conventional low-stringency conditions. Conventional hybridization conditions are described, 
for example, by Sambrook J. et al., in Molecular Cloning, A Laboratory Manual, 2 nd Edition, 
Cold Spring Harbor Press, Cold Spring Harbor, NY (1989) and by Haymes, B.D. et al. in Nucleic 
Acid Hybridization, A Practical Approach, IRL Press, Washington, D.C. (1985). While perfectly 
complementary oligonucleotides are preferred for detecting polymorphisms, departures from 
complete complementarity are contemplated where such departures do not prevent the molecule 
from specifically hybridizing to the target region. For example, an oligonucleotide primer may 
have a non-complementary fragment at its 5 ' end, with the remainder of the primer being 
complementary to the target region. Alternatively, non-complementary nucleotides may be 
interspersed into the oligonucleotide probe or primer as long as the resulting probe or primer is 
still capable of specifically hybridizing to the target region. 

Preferred genotyping oligonucleotides of the invention are allele-specific 
oligonucleotides. As used herein, the term allele-specific oligonucleotide (ASO) means an 
oligonucleotide that is able, under sufficiently stringent conditions, to hybridize specifically to 
one allele of a gene, or other locus, at a target region containing a polymorphic site while not 
hybridizing to the corresponding region in another allele(s). As understood by the skilled artisan, 
allele-specificity will depend upon a variety of readily optimized stringency conditions, including 
salt and formamide concentrations, as well as temperatures for both the hybridization and 
washing steps. Examples of hybridization and washing conditions typically used for ASO probes 
are found in Kogan et al., "Genetic Prediction of Hemophilia A" in PCR Protocols, A Guide to 
Methods and Applications, Academic Press, 1990 and Ruano et al., 87 Proc. Natl. Acad. Sci. 
USA 6296-6300, 1990. Typically, an allele-specific oligonucleotide will be perfectly 
complementary to one allele while containing a single mismatch for another allele. 

Allele-specific oligonucleotide probes which usually provide good discrimination between 
different alleles are those in which a central position of the oligonucleotide probe aligns with the 
polymorphic site in the target region (e.g., approximately the 7 th or 8 th position in a 15 mer, the 
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8 th or 9 th position in a 16mer, the 1 0 th or 1 1 th position in a 20 mer). A preferred ASO probe for 
detecting p 2 AR gene polymorphisms at PS2 and PS5 comprises a nucleotide sequence selected 
from the group consisting of: 

5 '-TGCATGTCGGTGAGC-3 ' (SEQ ID NO:3) and its complement; 

5 '-TGCATGTAGGTGAGC-3 ' (SEQ ID NO:4) and its complement; 

5 '-GGTGGCCCGCCCTCC-3 ' (SEQ ID NO:5) and its complement; and 

5 '-GGTGGCCTGCCCTCC-3 ' (SEQ ID NO:6) and its complement. 

An allele- specific oligonucleotide primer of the invention has a 3' terminal nucleotide, or 
preferably a 3 ' penultimate nucleotide, that is complementary to only one nucleotide of a 
particular SNP, thereby acting as a primer for polymerase-mediated extension only if the allele 
containing that nucleotide is present. Allele-specific oligonucleotide primers hybridizing to 
either the coding or noncoding strand are contemplated by the invention: A preferred ASO 
forward primer for detecting P2AR gene polymorphisms at PS2 and PS5 comprises a nucleotide 
sequence selected from the group consisting of: 

CGAGTGTGCATGTCG (SEQ ID NO:7); CTCCCAGCTCACCGA (SEQ ID NO:8); 

CGAGTGTGCATGTAG (SEQ ID NO:9); CTCCCAGCTCACCTA (SEQ ID NO: 10); 

AGCAGTGGTGGCCCG (SEQ ID NO: 1 1); CTCCCTGGAGGGCGG (SEQ ED NO: 12); 

AGCAGTGGTGGCCTG (SEQ ID NO:13) and CTCCCTGGAGGGCAG (SEQ ID NO: 14)! 

Other genotyping oligonucleotides of the invention.hybridize to a target region located 
one to several nucleotides downstream of one of the novel polymorphic sites identified herein. 
Such oligonucleotides are useful in polymerase-mediated primer extension methods for detecting 
P2AR gene polymorphisms and thus are referred to herein as "primer-extension 
oligonucleotides". In a preferred embodiment, the 3 '-terminus of a primer-extension 
oligonucleotide is a deoxynucleotide complementary to the nucleotide located- immediately 
adjacent to the polymorphic site. A particularly preferred oligonucleotide primer for detecting 
p 2 AR gene polymorphisms at PS2 and PS5 by primer extension terminates in. a nucleotide 
sequence selected from the group consisting of: 

GTGTGCATGT (SEQ ID NO: 15); CCAGCTCACC (SEQ ID NO:16); 

AGTGGTGGCC (SEQ ID NO:17); and CCTGGAGGGC (SEQ ID NO:18). 

In some embodiments, a composition contains two or more differently labeled genotyping 
oligonucleotides for simultaneously probing the identity of nucleotides at two or more 
polymorphic sites. It is also contemplated that primer compositions may contain two or more 
sets of allele-specific primer pairs to allow simultaneous targeting and amplification of two or 
more regions containing a polymorphic site., 
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As further described below, the inventors herein have discovered that a patient's 
bronchodilating response to albuterol may be predicted with high confidence by genotyping only 
three of the polymorphic sites in the (3 2 AR gene: PS3, PS9 and PS1 1. Thus, the invention also 
provides a diagnostic kit for predicting an individual's response to a P-agonist. In one 
embodiment, the kit comprises a set of genotyping oligonucleotides for genotyping PS3, PS9 and 
PS 1 1 in the P2AR gene packaged in a container. The kit may also contain other components such 
asvhybridization buffer, where the oligonucleotides are to be used as allele-specific probes, or 
dideoxynucleotide triphosphates (ddNTPs), where the polymorphic sites are to be detected by 
primer extension. In a preferred embodiment, the set of genotyping oligonucleotides consists of 
three primer extension .oligonucleotides, one for genotyping PS3, one for genotyping PS9 and 
one for genotyping PS 1 1. The kit may also contain a polymerase and a reaction buffer optimized 
for primer extension mediated by the polymerase. Preferred kits may also include detection 
reagents, such as biotin- or fluorescent-tagged oligonucleotides or ddNTPs and/or an enzyme- 
labeled antibody and one or more substrates that generate a detectable signal when acted on by 
the enzyme. In a preferred embodiment, each of the genotyping oligonucleotides and all other 
reagents in the kit have been quality tested for optimal performance in a genotyping assay for 
each of PS3, PS9 and PS1 1 and the kit also contains instructions for performing the assay and 
assigning a P2AR haplotype pair from the results. It will be understood by the skilled artisan that 
the set of genotyping oligonucleotides and reagents for performing the genotyping assay will be 
provided in separate receptacles placed in the container if appropriate to preserve biological or 
chemical activity and enable proper use in the assay. 

p 2 AR genotyping oligonucleotides of the invention. may also be immobilized on or 
synthesized on a solid surface such as a microchip, bead, or glass slide (see, e.g., WO 98/20020 
and WO 98/20019). Such immobilized genotyping oligonucleotides may be used in a variety of 
polymorphism detection assays, including but not limited to probe hybridization and polymerase 
extension assays. Immobilized' P2AR genotyping oligonucleotides of the invention may comprise 
an ordered array of oligonucleotides designed to rapidly screen a DNA sample for 
polymorphisms in multiple genes at the same time. 

The above described oligonucleotide compositions and kits are useful in methods for 
genotyping and/or haplotyping the P2AR gene in an individual. As used herein, the terms "P2AR 
genotype" and "P2AR haplotype" mean the genotype or haplotype contains the nucleotide pair or 
nucleotide, respectively, that is present at one or both of PS2 and PS5 and may optionally also 
include the nucleotide pair or nucleotide present at one or more additional polymorphic sites in 
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the P2AR gene. The additional polymorphic sites may be currently known polymorphic sites or 
sites that are subsequently discovered. In preferred embodiments, the additional polymorphic 
sites are selected from the group consisting of PS1, PS3, PS4, PS6, PS7, PS8, PS9, PS 10, PS1 1 , 
PS12andPS13. 

One embodiment of the genotyping method involves isolating from the individual a 
nucleic acid mixture comprising the two copies of the P2AR gene, or a fragment thereof, that are 
present in the individual, and determining the identity of the nucleotide pair at one or more of 
PS2 and PS5 in the two copies to assign a P2AR genotype to the individual. As will be readily 
understood by the skilled artisan, the two "copies" of a gene in an individual may be the same 
allele or may be different alleles. In a particularly preferred embodiment, the genotyping method 
comprises determining the identity of the nucleotide pair at each of PS 1, PS2, PS3, PS4, PS5, 
PS6, PS7, PS8, PS9, PS10, PS11, PS12, and PS13. 

Typically, the nucleic acid mixture is isolated from a biological sample taken from the 
individual, such as a blood sample or tissue sample. Suitable tissue samples include whole 
blood, semen saliva, tears, urine, fecal material, sweat, buccal, skin and hair. The nucleic acid 
mixture may be comprised of genomic DNA, mRNA, or cDNA and, in the latter two cases, the 
biological sample must be obtained from an organ in which the P2AR gene is expressed. 
Furthermore it will be understood by the skilled artisan that mRNA or cDNA preparations would 
hot be used to detect polymorphisms located in introns or in 5' and 3' ndntranscribed regions. If 
a P2AR gene fragment is isolated, it must contain the polymorphic site(s) to be genotyped. 

One embodiment of the haplotyping method comprises isolating from the individual a 
nucleic acid molecule containing only one of the two copies of the P2AR gene, or a fragment 
thereof, that is present in the individual and determining in that copy the identity of the 
nucleotide at one or both of polymorphic sites PS2 and PS5 in that copy to assign a P2AR 
haplotype to the individual. The nucleic acid may be isolated using any method capable of 
separating the two copies of the P2 AR gene or fragment such as one of the methods described 
above for preparing P2AR isogenes, with targeted in vivo cloning being the preferred approach. 
As will be readily appreciated by those skilled in the art, any individual clone will only provide 
haplotype information on one of the two P2AR gene copies present in an individual. If haplotype 
information is desired for the individual's other copy, additional P2AR clones will need to be 
examined. Typically, at least five clones should be examined to have more than a 90% 
probability of haplotyping both copies of the P2AR gene in an individual. In a particularly 
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preferred embodiment, the nucleotide at each of PS1, PS2, PS3, PS4, PS5, PS6, PS7, PS8, PS9, 
PS 10, PS 11, PS 12 and PS 13 is identified. 

In a preferred embodiment, a P2AR haplotype pair is determined for an individual by 
identifying the phased sequence of nucleotides at one or both of PS2 and PS 5 in each copy of the 
P2AR that is gene present in the individual. In a particularly preferred embodiment, the 
haplo typing method comprises identifying the phased sequence of nucleotides at each of PS1, 
PS2, PS3, PS4, PS5, PS6, PS7, PS8, PS9, PS10, PS1 1, PS12 and PS13 in each copy of the (3 2 AR 
gene. When haplotyping both copies of the gene, the identifying step is preferably performed 
with each copy of the gene being placed in separate containers. However, it is also envisioned 
that if the two copies are labeled with different tags, or are otherwise separately distinguishable 
or identifiable, it could be possible in some cases to perform the method in the same container. 
For example, if first and second copies of the gene are labeled with different first and second 
fluorescent dyes, respectively, and an genotyping oligonucleotide labeled with yet a third 
different fluorescent dye is used to assay the polymorphic site(s), then detecting a combination of 
the first and third dyes would identify the polymorphism in the first gene copy while detecting a 
combination of the second and third dyes would identify the polymorphism in the second gene 
copy. 

Another aspect of the invention is a method for predicting an individual's bronchodilating 
response to a p-agonist, which comprises assigning a P2AR haplotype pair to the individual and 
using the assigned haplotype pair to make a response prediction selected from the group 
consisting of: assignment of P2AR haplotype pair 4/6 or 2/2 predicts a good bronchodilating 

response; assignment of P2AR haplotype pair 2/6 predicts an intermediate bronchodilating 
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response and assignment of p 2 AR haplotype pair 2/4 or 4/4 predicts no bronchodilating response. 
These haploytpes pairs are set forth in Table 6. In a preferred embodiment, the P-agonist is 
albuterol. 

In one embodiment, the assigning step comprises determining a genotype for PS3, PS9 
and PS1 1 in the individual's P2AR gene and using the genotype to assign the haplotype pair. The 
genotype and haplotype pair combinations that are predictive of response to P-agonists are shown 
in Table 1 below. 



22 



Table 1 . Haplotype pairs predicted from genotypes. 


Genotype for PS3, PS9 and PS11 


Haplotype Pair 


G/G, G/G, G/G 


2/2 


A/A, A/A, A/A 


4/4 


A/G, A/G, A/G 


4/6 


G/G, G/G, G/A 


2/6 


A/G, A/G, G/G 


2/4 



Additional genotype and haplotype pair combinations that exist in the population are shown in 
Table 4 below. The ability to assign the P2AR haplotype pair by genotyping only PS3, PS9 and 
PS 1 1 is based on linkage disequilibrium between certain groups of polymorphic sites in the (3 2 AR 
gene (Fig. 4) as well as the frequencies of the individual P2AR haplotypes in the different 
population groups (Table 5). Thus, where an individual is heterozygous at two or all three of 
PS3, PS9 and PS1 1, e.g., more than one possible haplotype pair is consistent with the 
individual's genotype for PS3, PS9 and PS1 1, the probability of assigning the haplotype pair 
correctly is readily calculated from the frequencies that the individual haplotypes occur in that 
individual's population group. Based on the sample size in the study described below, it is 
believed that an individual's haplotype pair would be accurately predicted greater than 90% of 
the time. In a preferred embodiment, an individual's P2AR haplotype pair is assigned by 
genotyping one or more additional polymorphic sites selected from the group consisting of PS1, 
PS2, PS4, PS5, PS6, PS7, PS8, PS9, PS10, PS12 and PS13. 

The ability to predict a patient's response to a p-agonist is useful for physicians in making 
decisions about how to treat an asthma patient for bronchospasm. An asthma patient whose 
haplotype pair indicates the patient will probably respond well to the agonist would be a better 
candidate for p-agonist therapy than a patient who is likely to exhibit an intermediate response or 
no response, and the physician would be able to determine with less trial and error which . 
individuals should receive an alternative form of therapy. 

The following pairs of P2AR sites exhibit about .100% linkage disequilibrium in 
Caucasians: PS1 and PS 10 (amino acid 27); PS3 and PS9 (amino acid 16); PS4 and PS 10; and 
PS6 and PS 10. This provides the basis for another aspect of the invention, which is a method for 
predicting a Caucasian individual's genotype for one or both of PS9 and PS10, both of which 
have been linked to various diseases and conditions; The method comprises determining a first 
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genotype for one or more of PS1, PS3, PS4 or PS6 in the individual's P2AR gene and using the 
first genotype to assign a second genotype for one or both of PS9 and PS 10 to the individual. 
Preferably, the first and second genotypes correspond to one of the genotype combinations 
shown in Table 2 below: 



Table 2: Genotype Combinations 


PolvmnmViif* SIitf*<; 




PS1 PS 10 


a /a rvo 


P<31 P910 

X O 1 , XulU 


rifci p/p 


P91 P<HO 


a /p. p/p 

J\f\J 9 VJ/ V_y 


poo pen 


A/A A/A 


pea PQQ 


p/p ri/p 


PS1 P9Q 


• A/n A /p 


pQ4 PS 10 


rvp p/p 


PC4. PS in 


n/p nvp 


PS4 PS 10 


p/o p/o 


pCfi psm 


p/p p/p 


PS6 PS10 

X L_> V/ } X U X.XJ 


T/T P/P 


PS6, PS 10 


C/T, G/C 


PS1,PS3,PS9,PS10 


G/G, A/A, A/A. C/C 


PS1, PS3, PS9, PS10 


A/G, G/G, G/G, G/C 


PS1.PS3, PS9.PS10 


A/A, G/G, G/G, G/G 


PS1.PS3. PS9, PS 10 


G/G, A/G, A/G, C/C 


PS1.PS3, PS9, PS10 


A/G, A/G, A/G, G/C 


PS3.PS4.PS9.PS 10 


A/A, C/C, A/A, C/C 


PS3, PS4, PS9, PS10 


G/G, G/G, G/G, G/G 


PS3, PS4, PS9, PS10 


A/G, C/C, A/G. C/C 


PS3.PS6.PS9.PS 10 


A/A, T/T, A/A, C/C 


PS3.PS6, PS9. PS10 


G/G, C/C, G/G, G/G 


ps3,ps6,ps9, psio 


A/G, T/T, A/G, C/C 


PS3.PS6, PS9.PS10 


G/G, C/T, G/G, G/C 


PS3.PS6.PS9.PS10 


A/G, C/T, A/G, G/C 



In a preferred. embodiment, the first genotype is for PS3 and one or more of PS1, PS4 and PS6, 
and the second genotype is for both PS9 and PS 1 0. 

This genotyping prediction method is useful for predicting the individual's predisposition 
to a disease or condition associated with one of the alternative alleles at PS9 or PS10 in the P2AR 
gene. For example, a PS9 genotype of G/G means the individual expresses only the P2AR Glyl6 
variant and is predisposed to atopy and nocturnal asthma, whereas a PS9 genotype of A/A . 
indicates the individual expresses only the P2AR Argl6 variant and is predisposed to MG. 
Similarly, a PS10 genotype of C/C means the individual expresses only the (5 2 AR Gln27 variant 
and is predisposed to childhood asthma, while a PSIO genotype means the individual expresses 
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only the P2AR Glu27 variant and is predisposed to obesity. Where the first genotype is for PS3 
and one or more of PS1, PS4, and PS6, the genotypes at PS9 and PS 10 can be predicted, and 
such genotypes are useful for predicting severity of asthma (G/G at PS9, G/G at PS 10) and BHR. 
However, the present invention is not limited to the P2AR polymorphism or haplotype 
associations presently known but is applicable to future discoveries of associations between 
polymorphisms or haplotype for PS9 and PS 10 and disease, severity of disease, staging of 
disease, or any other phenotype. 

It is also contemplated that the above genotyping and haplotyping methods of the 
invention may be performed in combination with identifying the genotype(s) and/or haplotype(s) 
for other genomic regions. 

In the genotyping and haplotyping methods of the invention, the identity of a nucleotide 
(or nucleotide pair) at a polymorphic site may be determined by amplifying a target region(s) 
containing the polymorphic site(s) directly from one or both copies of the P2AR gene present in 
the individual and the sequence of the amplified region(s) determined by conventional methods. 
It will be readily appreciated by the skilled artisan that only one nucleotide will be detected at a 
polymorphic site in individuals who are homozygous at that site, while two different nucleotides 
will be detected if the individual is heterozygous for that site. The polymorphism may be 
identified directly, known as positive-type identification, or by inference, referred to as negative- 
type identification. For example, where a SNP is known to be guanine and cytosine in a 
reference population, a site may be positively determined to be either guanine or cytosine for an 
individual homozygous at that site, or both guanine and cytosine, if the individual is 
heterozygous at that site. Alternatively, the site may be negatively determined to be not guanine 
(and thus cytosine/cytosine) or not cytosine (and thus guanine/guanine). 

. The target region(s) may be amplified using any oligonucleo tide-directed amplification 
method, including but not limited to polymerase chain reaction (PCR) (U.S. Patent No. 
4,965,188), ligase chain reaction (LCR) (Barany et al., Proc. Natl Acad. ScL USA 88:189-193, 
1991; WO90/01069), and oligonucleotide ligation assay (OLA) (Landegren et al., Science 
241: 1077-1 080, 1988). Oligonucleotides useful as primers or probes in such methods should 
specifically hybridize to a region of the nucleic acid that contains or is adjacent to the 
polymorphic site. Typically, the oligonucleotides are between 10 and 35 nucleotides in length 
and preferably, between 15 and 30 nucleotides in length. Most preferably, the oligonucleotides 
are 20 to 25 nucleotides long. The exact length of the oligonucleotide will depend on many 
factors that are routinely considered and practiced by the skilled artisan. 
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Other known nucleic acid amplification procedures may be used to amplify the target 
region including transcription-based amplification systems (U.S. Patent No. 5,130,238; EP 
329,822; U.S. Patent No. 5,169,766, WO89/06700) and isothermal methods (Walker et L., Proc. 
Natl Acad. Set USA 89:392-396/1992. 

5 A polymorphism in the target region may also be assayed before or after amplification 

using one of several hybridization-based methods known in the art. Typically, allele-specific 
oligonucleotides are utilized in performing such methods. The allele-specific oligonucleotides 
may be used as differently labeled probe pairs, with one member of the pair showing a perfect 
match to one variant of a target sequence and the other member showing a perfect match to a 

10 different variant. In some embodiments, more than one polymorphic site may be detected at 
once using a set of allele-specific oligonucleotides or oligonucleotide pairs. Preferably, the 
members of the set have melting temperatures within 5°C, and more preferably within 2°C 3 of 
each other when hybridizing to each of the polymorphic sites being detected. 

Hybridization of an allele-specific oligonucleotide to a target polynucleotide may be 

15 performed with both entities in solution, or such hybridization may be performed when either the 
oligonucleotide or the target polynucleotide is covalently or noncovalently affixed to a solid 
support. Attachment may be mediated, for example, by antibody-antigen interactions, poly-L- 
Lys, streptavidin or avidin-biotin, salt bridges, hydrophobic interactions, chemical linkages, UV 
cross-linking baking, etc. Allele-specific oligonucleotides may be synthesized directly on the 

20 solid support or attached to the solid support subsequent to synthesis. Solid-supports suitable for 
use in detection methods of the invention include substrates made of silicon, glass, plastic, paper 
and the like, which may be formed, for example, into wells (as in 96-well plates), slides, sheets, 
membranes, fibers, chips, dishes, and beads. The solid support may be treated, coated or . 
derivatized to facilitate the immobilization of the allele-specific oligonucleotide or target nucleic 

25 acid. 

The genotype or haplotype for one or more polymorphic sites in the P2AR gene of an 
individual may also be determined by hybridization of one or both copies of the gene, or a 
fragment thereof, to nucleic acid arrays and subarrays such as described in WO 95/1 1995. The 
arrays would contain a battery of allele-specific oligonucleotides representing each of the 
30 polymorphic sites to be included in the genotype or haplotype. 

The identity of polymorphisms may also be determined using a mismatch detection 
technique, including but not limited to the RNase protection method using riboprobes (Winter et 
al., Proa Natl Acad. Set USA 82:7575, 1985; Meyers et al., Science 230:1242, 1985) and 



proteins which recognize nucleotide mismatches, such as the E. coli rnutS protein (Modrich, P. 
Ann. Rev. Genet 25:229-253 (1991). Alternatively, variant alleles can be identified by single 
strand conformation polymorphism (SSCP) analysis (Orita et aL, Genomics 5:874-879, 1989; 
Humphries et aL, in Molecular Diagnosis of Genetic Diseases, R. Elles, ed., pp 321-340, 1996) 
or denaturing gradient gel electrophoresis (DGGE) (Wartell et aL, Nucl Acids Res. 18:2699- 
2706, 1990; Sheffield et aL, Proc. Natl. Acad. ScL USA 86:232-236, 1989). 

A polymerase-mediated primer extension method may also be used to identify the 
polymorphism(s). Several such methods have been described in the patent and scientific 
literature and include the "Genetic Bit Analysis" method (W092/1 5712) and the 
ligase/polymerase mediated genetic bit analysis (U.S. Patent 5,679,524. Related methods are 
disclosed in WO91/02087, WO90/09455, W095/17676, andU.S: Patent Nos. 5,302,509 and 
5,945,283. Extended primers containing a polymorphism may be detected by mass spectrometry 
as described in U.S. Patent No. 5,605,798. An other primer extension method is allele-specific 
PCR (Ruano et aL, Nucl. Acids Res. 17:8392, 1989; Ruano et aL, Nucl Acids Res. 19, 6877-6882, 
1991; WO 93/22456; Turki et aL, Clin. Invest. 95:1635-1641, 1995). In addition, multiple 
polymorphic sites may be investigated by simultaneously amplifying multiple regions of the 
nucleic acid using sets of allele-specific primers as described in Wallace et al. (WO89/10414). 

The above described genotyping methods are useful in methods for determining the 
frequency of a P2AR genotype or haplotype in a population. The method comprises determining 
the genotype or the haplotype pair for the P2AR gene that is present in each member of the 
population and calculating the frequency any particular P2AR genotype or haplotype is found in 
the population. In a preferred embodiment, the P2AR genotype comprises the nucleotide pair(s) 
detected at each of PS 1 -PS 1 3 . The population may be a reference population, a family 
population, a same sex population, a population group, a trait population (e.g., a group of 
individuals exhibiting a trait of interest such as a medical condition or response to a therapeutic 
treatment). 

Frequency data for such P2AR genotypes or haplotypes in reference and trait populations 
are useful for identifying an association between a trait and any novel P2AR polymorphism, 
genotype or haplotype. The trait may be any detectable phenotype, including but not limited to 
susceptibility to a disease or response to a treatment. The method involves obtaining data on the 
frequency of the genotype(s) or haplotype(s) of interest in a reference population as well as in a 
population exhibiting the trait. Frequency data for one or both of the reference and trait 
populations may be obtained by genotyping or haplotyping the P2AR gene in each individual in 
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the populations using one of the methods described above. The haplotypes for the trait 
population may be determined directly or, alternatively, by the predictive genotype to haplotype 
approach described above. In another embodiment, the frequency data for the reference and/or 
trait populations is obtained by accessing previously determined frequency data, which may be in 
written or electronic form. For example, the frequency data may be present in a database that is 
accessible by a computer. Once the frequency data is obtained, the frequencies of the 
genotype(s) or haplotype(s) of interest in the reference and trait populations are compared. In a 
preferred embodiment, the frequencies of all genotypes and/or haplotypes observed in the 
populations are compared. If a particular genotype or haplotype for the P2AR gene is more 
frequent in the trait population than in the reference population at a statistically significant 
amount, then the trait is predicted to be associated with that P2AR genotype or haplotype. 
Preferably, the (3 2 AR genotype or haplotype being compared in the trait and reference 
populations is selected from the full-genotypes and full-hap lotypes shown in Tables 4 and 5 
respectively, or from sub-genotypes and sub-haplotypes derived from these genotypes and 
haplotypes. 

In a preferred embodiment of the method, the trait of interest is a clinical response 
exhibited by a patient to some therapeutic treatment, for example, response to a drug targeting 
P2AR or response to a therapeutic treatment for a medical condition. As used herein, "medical 
condition" includes but is not limited to any condition or disease manifested as one or more 
physical and/or psychological symptoms for which treatment is desirable, and includes 
previously and newly identified diseases and other disorders. As used herein the term "clinical 
response" means any or all of the following: a quantitative measure of the efficacy of the therapy, 
no efBcacy, and adverse events (i.e., side effects). 

In order to deduce a correlation between clinical response to a treatment and a P2AR 
genotype or haplotype, it is necessary to obtain, data on the clinical responses exhibited by a 
population of individuals who received the treatment, hereinafter the "clinical population". This 
clinical data may be obtained by analyzing the results of a clinical trial that has already been run 
and/or the clinical data may be obtained by designing and carrying out one or more new clinical 
trials. As used herein, the term "clinical trial" means any research study designed to collect 
clinical data on responses to a particular treatment, and includes but is not limited to phase I, 
phase II and phase IH clinical trials. Standard methods are used to define the patient population 
and to enroll subjects. 
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It is preferred that selection of individuals for the clinical population comprises grading 
such candidate individuals for the existence of the medical condition of interest and then 
including or excluding individuals based upon the results of this assessment. This is important in 
cases where the symptom(s) being presented by the patients can be caused by more than one 
underlying condition, and where treatment of the underlying conditions are not the same. An 
example of this would be where patients experience breathing difficulties that are due to either 
asthma or respiratory infections. If both sets were treated with an asthma medication, there 
would be a spurious group of apparent non-responders that did not actually have asthma. These 
people would affect the ability to detect any correlation between haplotype and treatment 
outcome. This grading of potential patients could employ a standard physical exam or one or 
more lab tests. Alternatively, grading of patients could use haplo typing for situations where there 
is a strong correlation between haplotype pair and disease susceptibility or severity. 

The therapeutic treatment of interest, or the control treatment (active agent or placebo in 
controlled trials),. is administered to each individual in the trial population and each individual's 
response to the treatment is measured using one or more predetermined criteria. It is 
contemplated that in many cases, the trial population will exhibit a range of responses and that 
the investigator will choose the number of responder groups (e.g., low, medium, high) made up 
by the various responses. In addition, the P2AR gene for each individual in the trial population is 
genotyped and/or haplotyped, which may be done before or after administering the treatment. 

After both the clinical and polymorphism data have been obtained, correlations between 
individual response and P2AR genotype or haplotype content are created. Correlations may be 
produced in several ways. In one method, individuals are grouped by their P2AR genotype or 
haplotype (or haplotype pair) (also referred to as a polymorphism group), and then the averages 
and standard deviations of continuous clinical responses exhibited by the members of each 
polymorphism group are calculated. 

These results are then analyzed to determine if any observed variation in clinical response 
between polymorphism groups is statistically significant. Statistical analysis methods which may 
be used are described in L.D. Fisher and G. vanBelle, "Biostatistics: A Methodology for the 
Health Sciences", Wiley-Interscience (New York) 1993. This analysis may also include a 
regression calculation of which polymorphic sites in the P2AR gene give the most significant 
contribution to the differences in phenotype. One regression model useful in the invention starts 
with a model of the form 

r = r 0 +Sxflf 
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where r is the response, r Q is a constant called the "intercept", S is the slope and d is the dose. 

To determine the dose, the most-common and least common nucleotides at the polymorphic site 
are first defined. Then, for each individual in the trial population, one calculates a "dose" as the 
number of least-common nucleotides the individual has at the polymorphic site of interest. This 
value can be 0 (homozygous for the least-common nucleotide), 1 (heterozygous), or 2 
(homozygous for the most common nucleotide). An individual's "response" is the value of the 
clinical measurement. Standard linear regression methods are then used to fit all the individuals' 
doses and responses to a single model (see e.g., L.D. Fisher and G. vanBelle, supra, Ch 9). The 
outputs of the regression calculation are the intercept r 0 , the slope S, and the variance (which 
measures how well the data fits this simple linear model). The Students t-test value and the level 
of significance can then be calculated for each of the polymorphic sites. 

A second method for finding correlations between P2AR haplotype content and clinical, 
responses uses predictive models based on error-minimizing optimization algorithms. One of 
many possible optimization algorithms is a genetic algorithm (R. Judson, "Genetic Algorithms 
and Their Uses in Chemistry" in Reviews in Computational Chemistry, Vol. 10, pp. 1-73, K. B. 
Lipkowitz and D. B. Boyd, eds. (VCH Publishers, New York, 1997). Simulated annealing (Press 
et al., "Numerical Recipes in C: The Art of Scientific Computing", Cambridge University Press 
(Cambridge) 1992, Ch. 10), neural networks (E. Rich and K. Knight, "Artificial Intelligence", 2 nd 
Edition (McGraw-Hill, New York, 1991, Ch. 18), standard gradient descent methods (Press et al., 
supra Ch. 10), or other global or local optimization approaches (see discussion in Judson, supra) 
could also be used. As an example, a genetic algorithm approach is described herein. This 
method searches for optimal parameters or weights in linear or non-linear models connecting 
P2AR haplotype loci and clinical outcome. One model is of the form 

where C is the measured clinical outcome, i goes over all polymorphic sites, a over all candidate 
genes, C 0 , w i a and W iM are variable weight values, R i a is equal to 1 if site i in gene a in the 

first haplotype takes on the most common nucleotide and -1 if it takes on the less common 
nucleotide. L i a is the same as R i a except for the second haplotype. The constant term C 0 and 

the weights w iA and W ifiL are varied by the genetic algorithm during a search process that 

minimizes the error between the measured value of C and the value calculated from Equation 1. 
Models other than the one given in Equation 1 can be readily incorporated by those skilled in the 
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art for analyzing the clinical and polymorphism data. The genetic algorithm is especially suited 
for searching not only over the space of weights in a particular model but also over the space of 
possible models (Judson, supra). 

Correlations may also be analyzed using analysis of variation (ANOVA) techniques to 
determine how much of the variation in the clinical data is explained by different subsets of the 
polymorphic sites in the P2AR gene. ANOVA is used to test hypotheses about whether a 
response variable is caused by or correlated with one or more traits or variables (in this case, 
polymorphism groups) that can be measured (Fisher and vanBelle, supra, Ch. 10). These traits or 
variables are called the independent variables. To carry out ANOVA, the independent 
variable(s) are measured and individuals are placed into groups based on their values for these 
variables.. In this case, the independent variable(s) refers to the combination of polymorphisms 
present at a subset of the polymorphic sites, and thus, each group contains those individuals with 
a given genotype or haplotype pair. The variation in response within the groups and also the 
variation between groups is then measured. If the within-group response variation is large 
(people in a group have a wide range of responses) and the response variation between groups is 
small (the average responses for all groups are about the same) then it can be concluded that the 
independent variables used for the grouping are not causing or correlated with the response 
variable. For instance, if people are grouped by month of birth (which should have nothing to do 
with their response to a drug) the ANOVA calculation should show a low level of significance. 
However, if the response variation is larger between groups than within groups, the F-ratio 
(— ^between groups" divided by "within groups") is greater than one. Large values of the F-ratio 
indicate that the independent variable is causing or correlated with the response. The calculated 
F-ratio is preferably compared with the critical F-distribution value at whatever level of 
significance is of interest. If the F-ratio is greater than the Critical F-distribution value, then one 
may be confident that the individual's genotype or haplotype pair for this particular subset of 
polymorphic sites in the P2AR gene is at least partially responsible for, or is at least strongly 
correlated with the clinical response. 

From the analyses described above, a mathematical model may be readily constructed by 
the skilled artisan that predicts clinical response as a function of P2AR genotype or haplotype 
content. Preferably, the model is validated in one or more follow-up clinical trials designed to 
test the model. 

The identification of an association between a clinical response and a genotype or 
haplotype (or haplotype pair) for the P2AR gene may be the basis for designing a diagnostic 
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method to determine those individuals who will or will not respond to the treatment, or 
alternatively, will respond at a lower level and thus may require more treatment, i.e., a greater 
dose of a drug. The diagnostic method may take one of several forms: for example, a direct 
DNA test (i.e., genotyping or haplotyping one or more.of the polymorphic sites in the P2AR 
gene), a serological test, or a physical exam measurement. The only requirement is that there be 
a good correlation between the diagnostic test results and the underlying P2AR genotype or 
haplotype that is in turn correlated with the clinical response. In a preferred embodiment, this 
diagnostic method uses the predictive haplotyping method described above. 

Any or all analytical and mathematical operations involved in practicing the methods of 
the present invention may be implemented by a computer. In addition, the computer may 
execute a program that generates views (or screens) displayed on a display device and with 
which the user can interact to view and analyze large amounts of information relating to the 
P2AR gene and its genomic variation, including chromosome location, gene structure, and gene 
family, gene expression data, polymorphism data, genetic sequence data, and clinical data 
population data (e.g., data on ethno geographic origin, clinical responses, genotypes, and 
haplotypes for one or more populations). The P2AR polymorphism data described herein may be 
stored as part of a relational database (e.g., an instance of an Oracle database or a set of ASCII 
flat files). These polymorphism data may be stored on the computer's hard drive or may, for 
example, be stored on a CD ROM or on one or more other storage devices accessible by the 
computer. For example, the data may be stored on one or more databases in communication with 
the computer via a network. 

Preferred embodiments of the invention are described in the following examples. Other 
embodiments within the scope of the claims herein will be apparent to one skilled in the art from 
consideration of the specification or practice of the invention as disclosed herein. It is intended . 
that the specification, together with the examples, be considered exemplary only, with the scope 
and spirit-of the invention being indicated by the claims which follow the examples. 

Examples 

The Examples herein are meant to exemplify the various aspects of carrying out the 
invention and are not intended to limit the scope of the invention in any way. The Examples do 
not include detailed descriptions for conventional methods employed, such as in the synthesis of 
oligonucleotides or preparation of antibodies. Such methods are well known to those skilled in 
the art and are described in numerous publication's, for example, Sambrook, Fritsch, and 
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Maniatis, Molecular Cloning: a Laboratory Manual; 2 nd Edition, Cold Spring Harbor Laboratory 
Press, USA, (1989). 

Example 1 

This example illustrates examination of the P2AR gene for polymorphic sites from about 
5 1 100 base pairs upstream of the ATG start site to about 700 base pairs downstream of the ATG 
start site. 

Amplification of Target Region 

Overlapping fragments of the p 2 AR gene were amplified from genomic DNA from the 

Index Repository and the asthma patient cohort, using the following PCR primers, with the 

10 indicated positions corresponding to GenBank Accession No. Ml 5 1 69: 

Fragment 1 

<iS Forward Primer: nt 495-517 

&3 Reverse Primer: complement of nt 1735-1708 

J - * 1241 nt product (495-1735) 

ill! 15 Fragment 2 

m 

m 



Forward Primer: 1671-1695 
Reverse Primer: complement of 2857-2831 
1187 nt product (1671-2831) 



20 The resulting PCR products were sequenced using dye terminator chemistry (Big-Dye, PE 



Biosystems) and an ABI 3700 capillary sequencer. The sequencing primers were designed to 
provide for overlapping -500 bp reads. 
. Analysis of Sequences for Polymorphic Sites 

Sequences were analyzed for the presence of polymorphisms using the Polyphred 
25 program (Nickerson et al., Nucleic Acids Res. 14:2745-2751, 1997). The presence of a 

polymorphism was confirmed on both strands. The polymorphisms and their locations in the 
P2AR gene are listed in Table 3 below. 
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Table 3. Polymorphisms Identified in the P2AR Gene 


Polymorphic 
Site 8 


Nucleotide Position 


Reference 
Allele 


Variant 
Allele 


Fieure 1 


Relative to CDS" 


PSl 


565 


-1023 


G 


A 


PS2 


879 


-709 


c 


A 


PS3 


934 


-654 


. G 


A 


PS4 


1120 


-468 


G 


c 


PS5 


1182 


-406 


C 


T 


PS6 


1221 


-367 


C 


T 


PS7 


1541 


-47 


C 


T 


PS 8 


1568 


-20 


T 


C 


PS9 


1633 


46 


A(Arg) 


G (Gly) 


PSIO 


1666 


79 


C(Gln) 


G (Glu) 


PSll 


1839 


252 


G 


A 


PS12 


2078 


491 


C 


T 


PS13 


2110 


523 


C 


A 



3 All polymorphic sites other than PS2 and PS5 have been previously reported. 
b CDS means coding sequence 



Example 2 

This example illustrates analysis of the P2AR gene polymorphisms identified in the Index 
Repository and asthmatic cohort for genotypes and haplotypes. 

The different genotypes containing these polymorphisms that were observed in the 
reference and asthma populations are shown in Table 4 below, /with the haplotype pair indicating 
the combination of haplotypes determined for each individual using the haplotype derivation 
protocol described below. 
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Table 4. Genotypes and Haplotype Pairs Observed for the P2-AR gene 



Genotype 


Polymorphic Site* 


HAP 


No. 


PSl 


PS2 


PS3 


PS4 


PS5 


PS6 


PS7 


PS8 


PS9 


PS10 


PS11 


PS12 


PS13 


Pair 


1 


A 


C 


G 


c 


c 


T 


T 


T 


A 


C 


G 


C 


c . 


1/1 


2 


A 


C 


G. 


G/C 


c 


cyr 


cyr 


cyr 


A/G 


G/C 


G 


C 


c 


1/2 


3 


A/G 


C 


A/G 


c 


c 


T 


T 


T 


A 


C 


G 


C 


c 


1/4 


4 


A/G 


C 


G 


c 


c 


T 


T 


. T 


A/G 


c 


G/A 


C 


A/C 


1/6 


5 


A 


C 


G 


c 


T/C 


T 


T 


T 


A 


c 


G 


C 


c 


1/9 


6 


A 


C 


G 


G 


C 


C 


C 


C 


G 


G 


G 


C 


c 


2/2 


7 


A/G 


A/C 


A/G 


G/C 


C 


C/T 


cn 


cn* 


A/G 


G/C 


G 


C 


c 


2/3 


8 


A/G 


C 


A/G 


G/C 


C 


cn 


cn 


cyr 


A/G 


G/C 


G 


C 


c 


2/4 


9 


A/G 


C 


A/G 


G/C 


C 


cn 


cn 


or 


G 


G/C 


G 


C 


c 


2/5 


10 


A/G 


c 


G 


G/C 


C 


cn 


cn 


cyr 


G 


G/C 


G/A 


C 


A/C 


2/6 


11 


A/G 


c 


G 


G/C 


c 


cn 


cn 


cn 


G 


G/C 


G/A 


cyr 


A/C 


2/7 


12 


A/G 


c 


A/G 


G/C 


c 


cn ■ 


cn 


cn 


A/G 


G/C 


G/A 


c 


A/C 


2/8 


13 


A/G 


c 


G 


G/C 


c 


cn 


cn 


cn 


G 


G/C 


G/A 


c 


c 


2/10 


14 


A/G 


c 


G 


G/C 


c 


cn 


cn 


cn 


G 


G/C 


G 


c 


c 


2/11 


15 


G 


A/C 


A 


C 


c 


T 


T 


T 


A 


C . 


G 


c 


c 


3/4 


16 


G 


C 


A 


c 


c 


T 


T 


T 


A 


c 


. G 


c 


c 


4/4 


17 


G 


c 


A 


C 


c 


T 


T 


T 


A/G 


c 


G 


c 


c 


4/5 


18 


G 


c 


A/G 


C 


c 


T 


T 


T 


A/G 


c 


G/A 


c 


A/C 


4/6 


19 


G 


c 


A/G 


C 


c 


T 


T 


• T 


A/G 


c 


G/A 


cyr 


A/C 


4/7 


20 


G 


c 


A 


c 


c 


T 


T 


T 


A 


c 


G/A 


c 


A/C 


4/8 


21 


A/G 


c 


A/G 


C 


T/C 


T 


T 


T 


A ' 


c 


G 


c 


c 


4/9 


22 


G 


c 


A/G 


c 


c 


T 


T 


T 


A/G 


c 


G/A 


c 


c 


4/10 


23 


A/G 


c 


A 


G/C 


c 


T 


cyr 


T 


A- 


c 


G 


c 


c 


4/12 


24 


G 


c 


G 


c 


c 


T 


T 


T 


G 


c 


A 


c 


A 


6/6 


25 


G 


c 


G 


c 


c 


T 


T 


T 


G 


c 


A 


cyr 


A 


6/7 


26 


A/G 


c 


G 


c 


T/C 


T 


T 


T 


A/G 


c 


G/A 


c 


A/C 


6/9 


27 


G 


c 


G 


c 


C 


T 


T 


T . 


G 


c 


G/A 


c 


A/C 


1 6/11 



*Homozygous positions are indicated by one nucleotide; heterozygous positions are indicated by 



two nucleotides. 

The haplotype pairs shown in the Table 4 were estimated from the unphased genotypes 
using an extension of Clark's algorithm (Clark, A.G, (1990) Mol Bio Evol 7, 1 1 1-122), in which 
haplotypes are assigned directly from individuals who are homozygous at all sites or 
heterozygous at no more than one of the variable sites. This list of haplotypes is then used to 
deconvolute the unphased genotypes in the remaining (multiply heterozygous) individuals. In 
our analysis the list of haplotypes was augmented with haplotypes obtained from three families 
(two multi-generation Caucasian families, one two-generation African-American family). 

Following this derivation protocol, the individual haplotypes and their frequencies in the 
different population groups in the Index Repository and asthmatic cohort were determined and 
are set forth in Table 5 below. 
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Table 5. (3 2 AR haplotypes and haplotype frequencies 



u An 

MAr 

M n 


Polymorphic Site 


Frequency (%)* 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


CA 


AA 


AS 


HL 


A 

1 


A 


C 


G 


C 


C 


T 


T 


T 


A 


C 


G 


C 


C 


0.7 


25.0 


12.5 


10.0 


0 


A 


C 


G 


G 


C 


C 


C 


C 


G 


G 


G 


C 


C 


48.3 


6.3 


10.0 


26.7 


0 

0 


G 


A 


A 


C 


C 


T 


T 


T 


A 


C 


G 


C 


C 


0.7 


0.0 


0.0 


0.0 


A 


G 


C 


A 






1 


TP 

T 


T 


A 


C 


G 


C 


C 


33.0 


29.7 


45.0 


40.0 


e 


G 


c 


A 


c 


c 


T 


T 


T 


G 


C 


G 


C 


C 


1.4 


0.0 


0.0 


0.0 


6 


G 


c 


G 


c 


c 


T 


T ■ 


T 


G 


C 


A 


C 


A 


13.2 


31.3 


30.0 


13.3 


7 


G 


c 


G 


c 


c 


T 


T 


T 


G 


C 


A 


T 


A 


1.0 


1.6 


0.0 


3.3 


8 


G 


C 


A 


c 


c 


T 


T 


T 


A 


C 


A 


C 


A 


0.7 


0.0 


0.0 


0.0 


9 


A 


C 


G 


c 


T 


T 


T 


T 


A 


C 


G 


C 


C 


0.0 


4.7 


0.0 


0.0 


10 


G 


C 


G 


c 


c 


T. 


T 


T 


G 


C 


A 


c 


C 


0.7 


0.0 


0.0 


3.3 


11 


G 


c 


G 


c 


c 


T 


T 


T 


G 


C 


G 


c 


C 


0.3 


0.0 


2.5 


0.0 


12 


A 


c 


G 


G 


c 


T 


T 


T 


A 


C 


G 


c 


C 


0.0 


1.6 


0.0 


3,3 






























100 


100 


100 


100 



* CA =Caucasian; AA=African- American; AS=Asian; HL=Hispanic-Latinos 



The inventors discovered that 13 variable sites exist in the P2AR, all within a span of 1 .6 
kb (Figure 1; Table 3). Two SNPs, at -709 (PS2) and -406 (PS5), have not been previously 
reported. Of the 2 13 (=8,192) possible combinations of these SNPs, only 12 were found in 
individuals from the index repository and the asthmatic cohort (Table 5). All SNPs and 
haplotypes were found to be in conformance with Hardy- Weinberg equilibrium, with the 
exception of homozygbtes for haplotype 1 in Hispanic Latinos, due to the existence of a single 
homozygote in this population for an otherwise rare haplotype. Four of the observed haplotypes 
occur in all populations sampled, although at markedly different frequencies. Haplotype 2, the 
most frequent in Caucasians (48%) is only seen at frequencies of 6%, "10%, and 27% in samples 
of African- Americans, Asians, and Hispanic Latinos, respectively. Furthermore, this particular 
haplotype is by far the most distinctive at the nucleotide level, having unique differences at four 
sites from all other haplotypes sampled. The distribution of haplotype 1 also indicates population 
differentiation at this locus with a >20 fold lower frequency in Caucasians compared, to the other 
groups. Also, haplotype 6 is more common in African-Americans and Asians compared to the. 
other two groups. Li contrast to the above, the frequency of haplotype 4 is similar in all groups. 
Assigning haplotypes from unphased genotype data from 200 individuals using the above 
derivation protocol gave the same results as molecular haplotyping except in a single subject due 
to a discrepancy at one SNP position (data not shown). 

Example 3 

This example illustrates phylogenetic analysis and linkage disequilibrium analysis of the 
individual p 2 AR haplotypes shown in Table 5. 
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This analysis used a variation of the minimal spanning network algorithm (Excoffier, L., 
et al., (1992) Genetics 131, 479-491.). An advantage of this algorithm over other methods is that 
it does not force a strictly bifurcating tree as a result. Thus, actual reticulations in.the "tree", 
such as those arising from evolutionary recombination among the haplotypes, can be visualized 
and interpreted. With this approach every haplotype is connected to the haplotype(s) most 
similar to itself and the results are shown in Figure 3. 

This phylo genetic analysis of the P2AR haplotypes revealed a deep divergence of 
haplotype 2 as well as potential evolutionary recombination. events. Haplotype 12 appears to be a 
recombinant between haplotypes 1 and 2. Also, haplotype 8 is best explained as a recombination 
between the highly frequent haplotypes 4 and 6. 

Linkage disequilibrium (A) for our largest sample, Caucasians, was quantitated using 
standard methodology (Hill, W.G., et al., Theor Appl Genet 38, 226-23 1 ; Hill, W.G, & Weir, 
B.S. (1994) Am J Hum Genet 54, 705-714) and the results are shown in Figure 4. Although 
many of the sites are in strong disequilibrium, it is clear that some pairs of close sites have 
reduced levels of linkage disequilibrium relative to more distantly spaced pairs of sites. This 
illustrates the hazards inherent in randomly selecting an individual SNP as a surrogate marker. . 
Furthermore, no individual SNP adequately predicted these complex haplotypes. , 

Example 4 

This example illustrates analysis of the P2AR haplotypes in Table 5 for association with 
asthma patients 5 response to albuterol. 

The patients in- the asthma cohort were enrolled from an outpatient facility as described in 
detail elsewhere (Yan, L., Galinsky, R.E., Bernstein, J.A., Liggett, S.B. & Weinshilboum, R.M. 
(1999) Pharmacogenetics in Press). Patients underwent spirometry before and 30 min after 
inhalation of 1 80 |ag albuterol delivered by nebulization. Forced expiratory volume in 1 second 
(FEVi) and forced vital capacity (FVC) were determined in triplicate. The predicted values for 
these measurements were calculated based on standard algorithms (Morris, J.F., Koski, A. & 
Johnson, L.C. (1971) Am Rev Respir Dis 103, 57-67). The change in the % predicted FEVi was 
considered the primary measure of responsiveness to albuterol (Dales, R.E., Spitzer, W.O., 
Tousignant, P., Schechter, M. & Suissa, S. (1988) Am Rev Respir Dis 138, 317-320). 

In the Caucasian members of the asthmatic cohort, the two rare SNPs at -709 and -406 
were not found. (For purposes of consistency the haplotypes discussed below continue to list 
these positions although they were invariant in the Caucasian population group in this cohort.) 
No other differences were found in the frequencies of the haplotypes between the Index 
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Repository and the asthmatic population. The haplotypes were assembled as pairs, and the 
eighteen haplotype pairs that were found in the asthmatic cohort are shown in Table 6 below. 



Table 6. Haplotype Pairs Observed in Asthma Patients 



Hap 






Chromosome A Polymorphic Sites 










Chromosome B Polymorphic Sites 








Pair 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 




1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


N 


% 


2/4 


A 


C 


G 


G 


C 


c 


C 


C 


G 


G 


G 


C 


c 




G 


C 


A 


C 


c 


T 


T 


T 


A 


C 


G 


c 


C 


37 


30.6 


2/2 


A 


C 


G 


G 


C 


c 


C 


C 


G 


G 


G 


C 


c 




A 


C 


G 


G 


c 


c 


C 


C 


G 


G 


G 


c 


C 


25 


20.7 


2/6 


A 


C 


G 


G 


c 


c 


c 


c 


G 


G 


G 


C 


c 




G 


C 


G 


C 


c 


T 


T 


T 


G 


C 


A 


c 


A 


22 


18.2 


4/4 


G 


C 


A 


C 


c 


T 


T 


T 


A 


C 


G 


C 


c 




G 


C 


A 


C 


c 


T 


T 


T 


A 


C 


G 


C 


C 


14 


11.6 


4/6 


G 


c 


A 


C 


c 


T 


T 


T 


A 


C 


G 


C 


c 




G 


C 


G 


c 


c 


T 


T 


T 


G 


C 


A 


c 


A 


8 


6.6 


2/5 


A 


c 


G 


G 


c 


C 


C 


C 


G 


G 


G 


c 


c 




G 


C 


A 


c 


c 


T 


T 


T 


G 


C 


G 


c 


C 


2 


1.7 


4/10 


G 


c 


A 


C 


c 


T 


T 


T 


A 


C 


G 


c 


c 




G 


C 


G 


c 


c 


T 


T 


T 


G 


c 


A 


c 


C 


2 


1.7 


•1/4 


A 


c 


G 


C 


c 


T 


T 


T 


A 


C 


G 


c 


c 




G 


C 


A- 


c 


c 


T 


T 


T 


A 


c 


G 


c 


C 


1 


0.8 


1/6 


A 


c 


G 


C 


c 


T 


T 


T 


A 


C 


G 


c 


c 




G 


C 


G 


c 


c 


T 


T 


T 


G 


c 


A 


c 


A 


1 


0.8 


2/11 


A 


c 


G 


G 


c 


C 


C 


C 


G 


G 


G 


c 


c 




G 


C 


G 


c 


c 


T 


T 


T 


G 


c 


G 


c 


C 


1 


0.8 


2/3 


A 


c 


G 


G 


c 


C 


C 


C 


G 


G 


G 


c 


c 




G 


A 


A 


c 


c 


T. 


T 


T 


A 


c 


G 


C 


C 


1 


0.8 


2/7 


A 


c 


G 


G 


c 


C 


C 


C 


G 


G 


G 


c 


c 




G 


C 


G 


c 


c 


T 


T 


T 


G 


c 


A 


T 


A 


1 


0.8 


2/8 


A 


c 


G 


G 


c 


C 


C 


C 


G 


G 


G 


c 


c 




G 


C 


A. 


c 


c 


T 


T 


T 


A 


c 


A 


C 


A 


1 


0.8 


3/4 


G 


A 


A 


C 


c 


T 


T 


T 


A 


C 


G 


c 


c 




G 


C 


A 


c 


c 


T 


T 


T 


A 


c 


G 


C 


C 


1 


0.8 


4/5 


G 


c 


A 


C 


c 


T 


T 


T 


A 


c 


G 


c 


c 




G 


C 


A 


c 


c 


T 


T 


T 


G 


c 


G 


C 


C 


1 


0.8 


4/7 


G 


c 


A 


C 


c 


T 


T 


T 


A 


c 


G 


c 


c 




G 


C 


G 


c 


c 


T 


T 


T 


G 


c 


A 


T 


A 


1 


0.8 


4/8' 


G 


c 


A 


C 


c 


T 


T 


T 


A 


c 


G 


c 


c 




G 


C 


A 


c 


c 


T 


T 


T 


A 


c 


A 


C 


A 


1 


0.8 


6/7 


G 


c 


G 


C 


c 


T 


T 


T 


G 


c 


A 


c 


A 




G 


C 


G 


c 


c 


T 


T 


T 


G 


c 


A 


T 


A 


.1 


0.8 
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Eighty-seven percent of the asthmatic cohort are represented by the five most common 
haplotype pairs. The association between changes in % predicted FEVi and haplotype pair was 
assessed by fitting an analysis of covariance model (ANCOVA) with terms for haplotype pair, 
sex and baseline severity. P-values from pair/wise comparisons by haplotype pair were adjusted 

10 for multiple comparisons by applying the Holm-Sidak step-down procedure (Ludbrook, J. (1998) 
Clin and Exp Pharmacology & Physiology 25, 1032-1037.). Haplotypes observed in <1% of the 
cohort that were single nucleotide derivatives of another, more frequent, haplotype were 
collapsed into the more frequent haplotype if the single nucleotide difference was unique to the 
rare haplotype. And, for purposes of analysis, the final data set excluded haplotype pairs that 

15 were observed in <5% of the cohort. In addition, the African- American asthma patients were 
excluded from the analysis due to the low number of individuals in this population group. For 
the analysis of individual SNPs, a similar ANCOVA model was employed with a discrete term 
for SNP genotype, and terms for sex and baseline severity. The Holm-Sidak step-down 
procedure was used to adjust the p-values from the individual SNP analyses for the number of 

20 tests performed. 

The responsiveness to the P-agonist albuterol for Caucasian individuals with the five 
common haplotype pairs is shown in Figure 5. Haplotype pair was significantly related to 
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improvements in % FEVi (p value=0.007 from ANCOVA). To delineate which pairs differ from 
one another comparisons were made for haplotype pair 4/4 (which had the lowest response) and 
haplotype pair 4/6 (which had the highest response), versus the other haplotype pairs. Pairwise 
tests were made while correcting for multiple comparisons. These results showed that the 
differences of the responses of those with haplotype pairs 4/4 and 4/6 were highly significant 
(change % FEVi = 8.53 ± 1.78 vs 19.1 ± 2.79, p=0.008). Differences were also found between 
those with pairs 2/4 vs 4/6, and 2/2 vs 4/4 (p=0.036 and 0.046, respectively). In contrast to these 
results with haplotypes, we found no association between the response to. albuterol and any 
individual SNP. For this analysis, a similar ANCOVA model as above was utilized. The p 
values for each SNP were all substantially >0.05 (adjusted for multiple comparisons). Based on 
this in vivo data, it appears that haplotype 4 is associated with depressed responsiveness and 
haplotype 2 with increased responsiveness. Since 0, 1 or 2 copies of these two haplotypes are 
present in our population as haplotype pairs 2/2, 2/4 and 4/4, a potential gene dose effect can be 
assessed by regression analysis. Such an analysis indeed showed a significant relationship 
between copy number of haplotype 2 (or 4) and the response to albuterol (p=0.009). 

The report by Martinez et al, supra, examined response to albuterol and individual 
polymorphisms at position 46 (Gly or Argl6) or 79 (Glu or Gln27) in a group consisting of 
normal, "wheezy", and asthmatic children (10.8 ± 0.6 years of age). One hundred eighty eight 
subjects had both parents being Caucasian, 40 had one parent being Hispanic and. 41 had both 
parents being Hispanic. Only 14% of the subj ects had asthma. This study reports an association 
(p=0.05 for trend) between the Argl6 allele (adenine at nucleic acid 46, PS9 of Table 3) and the 
prevalence of bronchodilator responsiveness in asthmatics. This analysis did not utilize 
haplotypes at the two loci. However, this previous finding with the individual polymorphism at 
PS9 is opposite to what the inventors herein found regarding this site within haplotype pairs 2/2 
and 4/4. That is, within these Haplotypes guanine at PS9 is associated with the greater 
bronchodilator response to albuterol, not adenine. Moreover, the very small number of 
homozygous Argl6 asthmatics (5) who had a positive bronchodilator response, the p value of 
0.05, the potential confoundment of race, and the use of mild pediatric asthmatics, makes the 
Martinez et al. study incomparable to the inventor's study described herein, which utilized 
multisite haplotypes, a greater number of asthmatics, and adult Caucasian subjects having a range 
of asthma severity. Thus, those skilled in the art would not be able to predict from the Martinez 
et al. study the results of the haplotype study discussed herein. 
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Two other studies reported limited (2 site) P2AR haplotypes and some association with 
asthma, neither of which were related to the bronchodilator response to albuterol. In one study 
(Weir et al., Am JResp Crit Care Med 158:787-791, 1998) the relationship between haplotype 
encompassing only PS9 and PS 10 and asthma death/near death and severity was examined. 

5 Death/near death was not associated with a haplotype, although the more moderate asthmatics 
were more likely to have the Glyl6/Gln27 haplotype than mild asthmatics. Those skilled in the 
art would not be able to predict from this study of two polymorphic sites pur results involving 
multisite haplotypes and the bronchodilating response to albuterol. In another study (D'Amato et 
al., Am JResp Crit Care Med 158:1968-1973, 1998) haplotypes at PS9 and PS10 were examined 

10 for any relationship to bronchial hyperreactivity. Bronchial hyperreactivity is the constriction of 
the airways during inhalation of certain bronchospastic agents such as methacholine. It is not a 
test for the relaxation (bronchodilating) response to a p-agonist such as albuterol. In D'Amato et 
al., the Glyl6/Glyn27 haplotype was associated with bronchial hyperreactivity to the inhaled 
bronchoconstrictor methacholine. Those skilled in the ait would not be able to predict from this 

15 prior study the relationship between a 13 site p 2 AR haplotype and the response to the 
bronchodilator albuterol. 

The inventors also performed regression calculations to determine the minimal number of 
polymorphisms that predict association between P2AR polymorphisms and response to albuterol. 
The regression placed individuals in the clinical cohort into groups having 0, 1, or 2 copies of a 

20 haplotype or SNP. It then calculated the significance (reported as a p-value) of the regression line 
connecting number of copies of the haplotype with response (change in % FEV1). Each site was 
tested independently to find even marginal association (p<0.1). Any pair, triple, etc. of sites that 
showed this nominal association were further combined into haplotypes (sub-haplotypes). A p- 
value was calculated for the association between each sub-haplotype with response/non-response. 

25 A haplotype containing n sites was discarded if there was an m-site haplotype with a greater 
association (smaller p-value) if m<n. In other words, we favor the. simplest association or 
explanation of the effect, and reject more complex associations. The most significant association 
was found with the 3-site haplotype comprised of polymorphic sites 3, 9, and 1 1 . 

Example 5 

30 This example illustrates analysis of the amount of in vitro expression of two of the 

predictive P2AR haplotype pairs. 

The two homozygous hapiotype pairs, 4/4 and 2/2, were common in the asthmatic cohort 
and displayed significant differences in the in vivo response to the agonist albuterol. To 
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determine whether the SNPs within these two haplotypes result in different levels of P2AR 
mRNA- or protein expression, transfection studies were carried out in the human embryonic 
kidney cell line HEK293. These cells were chosen because of their high transfection efficiency, 
their human origin, and the fact that they express P2AR (~10 finol/mg) and thus presumably have 
the relevant transcription factors for expression of the human gene. PCR products of the P2AR 
gene representing haplotypes 2 and 4 were derived from human genomic DNA, subcloned into 
pCR2.1 and the sequence verified. The pCR2.1 vector lacks a eukaryotic responsive promoter 
and thus the expression of the p 2 AR gene in mammalian cells is directed by the included p 2 AR 
promoter sequence. HEK293 cells were transfected using a lipofectamine method (Gene 
Therapy Systems, San Diego, CA). Cells were transfected with 10 jig p 2 AR plasmid, 2 jxg 
luciferase plasmid, and 50 jxl of lipofectamine reagent. Two days later cells were harvested for 
radioligand binding, mRNA studies, and luciferase activity as previously described (McGraw, 
D.W., Forbes, S.L., Kramer, L.A. & Liggett, S.B. (1998) J Clin Invest 102, 1927-1932.). Briefly, 
cells were washed three times in phosphate buffered saline, lysed in hypotonic 5 mM Tris, 2mM 
EDTA pH 7.40 buffer, and the particulates centrifuged at 40,000 xg for 1 0 min. Receptor 
expression was determined by radioligand binding using 400 pM l25 I-cyanopindolol ( 125 I-CYP). 
Non-specific binding was determined in the presence of 1 \\M propranolol. As might be 
expected, the levels of expression using the P2AR promoter were significantly less than what has 
previously been reported with viral promoters (Tepe, N.M. & Liggett, S.B. (2000) J Receptor & 
Signal Transduction Res 20, 75-85). Nevertheless, the levels obtained (-100 finol/mg receptor 
by radioligand binding) were clearly above background and in fact are similar to P2AR 
expression in the lung (Green, S.A., Turki, J., Bejarano, P., Hall, LP. & Liggett, S.B. (1995) Am 
JResp CellMol Biol 13, 25-33). Luciferase activity of cell lysates was determined using a 
commercial assay (Promega) and was used to control for minor differences in transfection 
efficiency from plate to plate. P2AR density is thus expressed as finol/mg membrane protein or 
finol/light unit (finol/LU) as previously described (McGraw et al., supra). mRNA levels were 
determined using ribonuc lease protection assays with a 563 bp antisense riboprobe corresponding 
to the most 3' region of the P2AR ORF as described previously (McGraw, D.W., Forbes, S.L., 
Witte, D.P., Fortner, C.N., Paul, R.L & Liggett, S.B. (1999) J Biol Chem 274, 32241-32247). p- 
actin mRNA was simultaneously quantitated confirming the equivalent loading of the samples. 
The results of these experiments are shown in Figure 6. 
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As shown in Figure 6A, the level of expression was clearly different between P2AR 
isogenes defined by haplotypes 2 and 4. When the construct defined by haplotype 2 was utilized, 
p 2 AR expression was 144 i 12.8 fmol/mg compared to 93.6 ± 5.7 fmol/mg when the haplotype 4 
construct was used (p<0.005). When corrected for transfection efficiency by quantitating 
luciferase activity derived from co-expression of a luciferase construct, the differences in 
expression of the P2AR (finol/LU) between the two haplotypes remained (Figure 6B). Similarly 
the P2AR mRNA levels as determined by quantitative RNAse protection assays were consistently 
higher for the haplotype 2-transfected cells than P2AR mRNA levels in the haplotype 4- 
transfected cells (0.663 ± 0.067 vs 0.320 ± 0.024 arbitrary units, p<0.005) (Figure 6C). The 
above results for both protein and mRNA expression are entirely consistent with the in vivo 
findings, where individuals with haplotype pair 2/2 had a —50% greater response than those with 
haplotype pair 4/4 (Figure 5). 

Comparisons of the sequence of haplotypes 2 and 4 reveal eight differences in the thirteen - 
SNP positions. These include differences in amino acid 19 of the BUP, and amino acids 16 and 
27 of the receptor protein. Each of these, studied in isolation, have been shown to alter 
expression or trafficking of the receptor (Green, S., Turki, J., Innis, M. & Liggett, S.B. (1994) _ 
Biochem 33, 9414-9419; McGraw, D.W., Forbes, S.L., Kramer, L.A. & Liggett, S.B. (1998) J 
Clin Invest 102, 1927-1932), but the effects of the various SNP combinations at these loci have 
not previously been explored. Interestingly, based on previous, work with the BUP SNP (PS7) 
studied in isolation, the skilled artisan would have predicted that the T allele would be associated 
with higher expression. This was evaluated, however, within the context of the Glyl6 (G at PS9) 
and Glu27 (G at PS 10) alleles, which as shown in Table 5 were never found in combination with 
T at PS7. ; This emphasizes the importance of studying polymorphisms in vitro within the context 
of a validated haplotype. 

The SNPs at the other five loci that differ. between haplotypes 2 and 4 are at PS1, PS3, 
PS4, PS6 and PS8. A database search for transcription factor binding sites (Heinemeyer, T., 
Chen, X., Karas, H., Kel, A.E., Kel, O.V., Liebich, I, Meinhardt, T., Reuter, I., Schacherer, F. & 
Wingender, E. (1999) Nucleic Acids Res 27, 318-322) shows that these SNPs are located within, 
or closely flank, a number of potential cw-acting elements. For example, PS1 flanks, potential 
binding sites for AP-4 and C/EBP; PS3 and PS4 each flank an NF-1 consensus sequence; and PS 
6 is within a CP2 consensus sequence. Of note, the synonymous SNP at PS 13. which has been 
associated with altered responsiveness to albuterol in Japanese families (Ohe, M., Munakata, M., 
Hizawa, N., Itph, A., Doi, I, Yamaguchi, E., Homma, Y. & Kawakami, Y. (1995) Thorax 50, 
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353-359.) was invariant between haplotypes 2 and 4. However, this SNP along with the SNP at 
PS1 1 distinguishes one common haplotype (haplotype 6) from the other common haplotypes. 
And, haplotype 6 appears to have some effect on response (Figure 5). 

Whether a smaller subset of P2AR SNPs defines the cellular expression phenotype cannot 
be ascertained from the current molecular approach since this would require systematic 
construction of vectors representing many unique haplotypes. Since a large fraction of these 
would in fact be rare (or never found) in the human population, the inventors have taken the 
approach of restricting our examination to the common haplotypes, since ultimately these are 
most relevant to pharmacogenetics. However, based on the results of the current in vivo 
responsiveness studies, the cell transfection experiments, and previous studies with isolated 
SNPs, it is likely that the biologic phenotype is directed by an interaction involving transcription, 
translation and protein processing that ultimately defines the effect of these haplotypes. 

In summary, the inventors identified thirteen polymorphic sites in a contiguous region of 
the 5' upstream and coding sequence of the P2AR in humans. Twelve distinct haplotypes were 
represented in a population of four major ethnic groups. Several relatively recent recombination 
events appear to be responsible for some haplotypes. A striking divergence in ethnic distribution 
was found for several haplotypes. Five haplotype pairs were common in asthmatics, and there 
were clear differences in the in vivo response to a P2AR agonist based on haplotype pair. In 
contrast, no isolated SNP had any predictive utility. The homozygous haplotypes 2/2 and 4/4 
with divergent agonist efficacies were shown to have differential effects on P2AR gene and 
protein expression in vitro, consistent with the direction and magnitude of the in vivo responses. 

In view of the above, it will be seen that the several advantages of the . invention are 
achieved and other advantageous results attained. 

As various changes could be made in the above methods and compositions without 
departing from the scope of the invention, it is intended that all matter contained in the above 
description and shown in the accompanying drawings shall be interpreted as illustrative and not 
in a limiting, sense. 

All references cited in this specification, including patents and patent applications, are 
hereby incorporated in their entirety by reference. The discussion of references herein is 
intended merely to summarize the assertions made by their authors and no admission is made that 
any reference constitutes prior art. Applicants reserve the right to challenge the accuracy and 
pertinency of the cited references. 
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